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CHAPTER  I V 
SAMPLING 

I 

In  the  great  bulk  of  man's  activities,  one  Is  forced  to  work  with 
imperfect  knowledge;  we  are  so  often  ignorant  of  many  of  the  key  facts 
which  would  allow  us  to  plan  successfully  and  act  with  assurance.  Our 
mastery  of  some  aspects  of  the  physical  world  leads  us  to  forget  how  limit** 
ed  and  how  provisional  our  understanding  of  most  of  the  world  really  is. 

And  our  statements  about  other  human  beings  arc  often  little  more  than 
wild  guesses  and  pronouncements  drawn  from  tradition. 

One  of  the  most  common  sources  of  man's  difficulties  in  knowing 
about  people,  things,  and  events  is  that  he  is  almost  always  dealing  with 
a small  fraction  of  the  world  that  concerns  him.  He  seldom  assesses  the 
whole-'he  usually  must  rely  on  a part,  a sample.  "In  his  continuing 
effort  to  adjust  to  the  environment,"  note  Professors  Mueller  and  Schuessler 
it  the  first  selection  In  this  chapter,  "man  has  necessarily  been  obliged  to 
treat  an  isolated  experience  as  typical  of  a larger  system  In  order  to  profit 
by  t», " 


The  problem,  of  course,  is  how  to  select  an  isolated  experience, 
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a sample  of  people,  things,  or  events,  so  that  it  best  represents  the  universe 
from  which  it  is  drawn.  Furthermore,  there  are  the  problems  of  measuring 
just  how  well  we  have  performed  this  task  and  of  making  the  best  estimates  we 
can  about  the  sampled  universe  on  the  basis  of  the  sample  that  is  in  our  hands. 

Statistical  theory,  based  on  mathematical  laws  of  probability,  provides 
some  solutions  to  these  problems.  The  use  of  random  samples,  purposive 
samples,  stratified  samples,  cluster  samples,  and  so  on,  as  described  by 
Mueller  and  Schuessler,  are  all  techniques  by  which  we  can  make  statements 
about  units  in  a population  which  we  have  never  observed  first  hand—and 
attach  a quantitative  estimate  of  our  accuracy. 

1 1 

If  ideas  about  probability  and  sampling  are  a commonplace  In  scien- 
tific inquiry  (and,  Indeed  in  everyday  human  affairs,  although  often  Implicit 
or  unrecognized),  it  is  also  true  that  they  have  not  been  readily  accepted  by 
the  law.  There  are  a number  of  cases,  of  course,  in  which  data  derived 
from  samples  have  been  presented  in  trial  proceedings.  And  material  drAwn 
from  the  United  States  Census  (which  is  heavily  dependent  cn  sampling  pro- 
cedures) has  usually  been  allowed  Intc  the  courts,  as  pcinted  out  by  Professor 
Zelsel  in  ’’The  Uniqueness  of  Survey  L.  teiu  \ ' But,  In  general,  the  law  has 
been  reluctant  to  rely  on  such  facts  and  remains  more  than  a little  suspicious 
of  the  statistician  and  his  talk  of  deviations,  levels  of  significance,  etc. 
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This  suspicion  on  the  part  of  the  law  may  rest  partially  on  a rather 
widespread  antipathy  toward  quantitative  modes  of  reasoning.  Our  legal 
system,  after  all,  was  developed  at  a time  when  few  lawyers  or  judges  or 
juries  could  have  been  expected  to  be  very  familiar  with  the  mathematical 
concepts  of  statistics;  and  even  today  many  people  find  the  logic  and  the 
system  of  notation  a serious  barrier.  It  is  also  possible  that  lawyers  and 
judges  view  theories  of  probability  and  the  data  associated  with  them  as  very 
suspicious  objects,  following  the  general  line  of  thought  that  "there  are  lies, 
thei'e  are  damn  lies,  and  there  are  statistics.  " And  then  there  is  the  pos- 
sibility that  the  intellectual  training  of  Judges  and  lawyers  directs  their 
Interest  to  the  specific,  the  particular  case,  the  unique  event.  Probability 
and  sampling,  however,  involves  classes  of  events,  tendencies,  the  average 
behavior  of  groups  of  units  rather  than  the  actual  behavior  < f anv  one  unit; 
and  the  law  may  feel  most  uncomfortable  when  it  is  asked  t'  inove  from  the 
concrete  reality  of  the  individual  to  the  abstract  characteristics  of  a popula- 
tion as  measured  by  a handful  of  examples. 

In  any  event,  whatever  may  be  ’he  general  source  of  the  law's  in- 
tellectual and  altitudinal  reluctance  to  e srace  the  cold  charms  of  statistics, 
there  have  been  immediate  difficulties  'erms  of  the  rule  s * evidence.  We 
think  that  Professor  Zeisel  explores  many  of  these  Issues.  First,  there  is 
the  problem  of  hearsay,  since  a good  deal  of  the  evidence  involved  is 
frequently  drawn  from  surveys  depending  on  verbal  responses  to  questionnaires 
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and  interviews  administered  to  large  numbers  of  people.  Second,  there 
is  the  problem  of  a representative  sample,  since  this  sort  of  evidence  is 
usually  not  based  on  the  responses  of  every  person  in  the  population  under 
examination.  We  think  that  Professor  Zeisel  is  right  when  he  says,  "The 
law  with  respect  to  survey  evidence  is  still  far  from  settled  doctrine. 

Thus  far,  the  development  has  been  guided  by  fears  that,  since  most  burveys 
are  hearsay  evidence,  a bad  one  might  too  easily  mislead  the  trier  of  facts. 
But  , . , while  the  dangers  of  an  uncritically  received  survey  are  real 
enough,  they  derive  not  from  its  hearsay  character,  but  primarily  from 
elements  easily  opened  to  expert  review, M Presumably,  this  expert 
review  can  be  provided,  Just  as  lawyers  and  judges  can  become  more 
aware  of  the  relevance  of  data  based  on  sampling.  And  when  that  happens 
the  "orderly  development  of  the  law, " placing  a greater  stress  on  such 
evidence,  may  well  become  a reality. 

1 1 1 

The  problems  of  using  data  derived  from  sampV-  re  not  confined 
to  evidence  for  legal  disputes,  of  coui'se.  Such  data  play  an  inJispensable 
part  in  almost  all  social  science  research,  since  the  limitations  of  time 
and  money  mean  that  we  usually  cannot  examine  the  entire  population  which 
we  wish  to  describe  and  analyse.  And  we  must  be  Just  as  careful  in  con- 
fronting the  problems  connected  with  hearsay  and  representativeness  in 
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research  as  in  litigation. 


The  selection  from  Automobile  Accident  Costs  and  Pay- 
ments is  most  important,  at  this  point,  as  an  illustration  of  how  detailed 
and  explicit  the  person  who  engages  in  research  must  be,  if  other  people 
arc  to  have  adequate  information  to  Judge  whether  the  sampling  design  is 
sound  and  whether  the  final  sample  is  likely  to  be  representative  enough  to 
justify  generalizations.  The  authors  of  this  study  are  careful  to  point  out 
the  problems  connected  with  insurance  company  records  and  hospital 
records,  in  determining  the  universe  under  examinatlon--all  individuals 
injured  or  M’led  in  automobite  accidents  that  occurred  in  the  state  of 
Michigan  during  one  calendar  year.  They  note  how  police  records  also 
have  their  faults,  but  are  probably  the  best  source  of  data  available  when 
supplemented  by  court  records.  The  authors  tell  us  how  many  question- 
naires in  the  sample  were  not  completed  and  why.  All  of  these  issues  and 
more  are  made  painstakingly  clear  so  that  we  have  a basis  for  deciding  if 
the  cample  can  be  considered  a random  one  and  if  distorting  errors  have 
crept  in  as  the  study  moves  from  a list  of  names  on  a piece  of  paper  to  a 
set  of  completed  questionnaires  and  Interviews.  If  we  do  not  have  this  kind 
of  information,  we  arc  left  In  the  position  of  simply  accepting  or  rejecting 
the  author's  conclusion  on  the  basis  of  intuition,  faith  in  authority,  etc. 


* 5 - 


I V 

The  last  selection  in  this  chapter  "Election  Survey  Procedures  of 
the  Gallup  Poll,  " by  Paul  Perry,  is  of  interest  because  the  problem  it 
tackles  is  a vex*y  common  one  in  designing  social  science  research.  Fre- 
quently, there  is  no  master  list  of  people,  things,  or  events  forming  the 
universe  to  be  investigated.  Or,  if  such  lists  do  exist,  they  are  frequently 
out-of-date,  seriously  incomplete,  or  otherwise  defective. 

Selecting  a random  sample  from  an  existing  list  or  file  of  all  the 

units  in  the  universe  being  studied  is  relatively  simple,  since  we  can  use  a 

1 

table  of  random  numbers  or  take  every  nth  unit.  When  no  such  list  or 

- 

See,  for  example,  John  G.  Peatman,  Introducti on  ^o  Applied 
Statistics,  (New  York:  Harper  and  Row,  1963),  pp.  424-425. 

file  is  available,  however,  the  problem  of  sampling  grows  more  difficult 
and  this  is  typically  the  case  when  we  want  to  generalize  to  an  entire  com- 
munity, to  a segment  of  the  population  of  the  United  States,  or  (as  in  the 
case  of  the  Gallup  Poll  dealing  with  elections)  to  the  voting  population  of  the 
United  States  as  a whole.  The  procedures  us^d  in  such  cases  vary  a good 
deal,  but  they  often  involve  some  form  of  the  basic  process  underlying  the 
procedure  descr.bed  by  Paul  Perrys 


/ / First,  the  division  of  the  universe  to  be  studied  into  a 


set  of  groups,  geographical  areas,  etc. , of  smaller  size. 

/ / Second,  the  selection  of  a random  sample  of  these 

smaller  segments  of  the  universe. 

/____/  Third,  the  selection  of  a random  sample  from  each  of 
the  segments. 

Essentially,  all  that  is  being  done  in  this  multi-stage  sampling 
is  to  divide  the  universe  into  more  manageable  sections  and  then  to  proceed 
with  the  business  of  selecting  a random  sample~by  first  randomly  selecting 
sections  and  then  randomly  selecting  units  within  these  smaller  and  more  con- 
venient subdivisions.  The  important  thing  is  to  maintain  the  basic  idea  of 
random  sampling — namely,  that  every  unit  in  the  universe  has  an  equal 
probability  of  being  selected.  If  the  number  of  units  in  each  subdivision  is 
not  the  same,  we  must  take  this  fact  into  account. 

The  problem  here  is  that  if  subdivisions  containing  different  numbers 
of  units  are  given  the  same  probability  of  being  selected,  and  if  the  same 
number  of  units  is  selected  from  each  subdivision,  we  end  up  with  a situa- 
tion in  which  each  unit  no  longer  has  an  equal  probability  of  appearing  in 
the  sample.  (The  chance  of  a unit  appearing  in  the  sample  equals  the 
probability  of  its  subdivision  being  selected,  multiplied  by  the  probability 
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that  It  will  be  selected  from  within  the  subdivision, ) 


There  are  two  ways  to  solve  this  rather  common  difficulty: 

(1)  We  can  give  each  subdivision  an  equal  chance  of  being  selected  and  then 
randomly  select  the  same  percentage  of  units  from  each  subdivision;  or 

(2)  we  can  weight  each  subdivision  by  the  number  of  units  it  contains, 
making  the  chances  of  a subdivision  being  selected  proportionate  to  its 
size,  and  then  randomly  select  the  same  number  of  units  from  each  sub- 
division. 

The  main  point,  however,  is  simply  this:  Statistical  data  concerning 
social  behavior  is  becoming  of  ever  greater  importance  in  all  aspects  of 
human  activity,  including  making  and  applying  the  law.  If  the  problems  we 
encounter  in  this  process  are  to  be  handled  intelligently  and  justly,  there 
must  be  an  informed,  cooperative  effort  among  a variety  of  disciplines. 
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UNIVERSE  AND  SAMPLE  I 


The  necessity  of  sampling  in  the  social  sciences  is,  we 
feel,  made  obvious  by  Mueller's  and  Schuessler's  reference  to 
a buyer  sampling  one  peach  from  a bushel  before  making  his  pur- 
chase. It  is  not  possible  for  him  to  test  each  piece  of  fruit  and 
therefore  he  relies  on  a sample.  The  same  is  true  for  those  who 
would  inquire  into  human  behavior.  It  is  usually  not  possible  to 
interview  every  student  at  a university,  for  example,  so  the  re- 
searcher must  also  rely  on  a sample.  There  is,  however,  a basic 
difference  between  the  kind  of  sampling  which  a buyer  of  peaches 
engages  in,  and  the  sampling  procedures  used  in  social  science 
research.  Scientific  sampling  is  a body  of  technical  procedures 
which  is  distinguishable  by  (1)  its  emphasis  on  a well-defined 
universe,  and  (2)  a random  selection  of  cases  from  that  universe. 

In  statistical  language  the  term  universe  refers  to  a 
totality  of  units  possessing  a well-defined  common  characteris- 
tic which  determines  membership  in  the  set.  Thus,  such 
things  as  all  American  families  in  a given  year  or  all  students 
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ever  graduated  from  American  colleges  would  constitute  statistical 
universes.  It  is,  however,  the  variab’e  properties  of  the  sampling 
units  with  which  the  scientist  is  primarily  concerned.  As  Mueller 
and  Schuesslei  put  it:  "We  are  seldom  . . . interested  in  families 
qua  families;  rather  our  interest  will  lie  in  one  or  more  of  their 
relevant  traits:  income,  nationality,  size,  social  status,  religion, 
and  so  on. " 

Defining  a universe  is  the  preliminary  step  in  the  sampling 
procedure.  Then  the  sample  must  be  drawn,  and  it  is  in  the  selec- 
tion of  sample  respondents  that  we  find  a basic  feature  of  social 
science  research.  There  are  two  kinds  of  sampling  procedures, 
random  and  non-random,  and  according  to  Mueller  and  frchuessler: 
"The  ideal  sampling  procedure  is  one  in  which  the  drawings  are 
affected  by  impartial  chance  factors  alone,  with  the  result  that  one 
item  in  the  universe  is  as  likely  to  be  included  in  the  sample  as 
another.  " This,  of  course,  is  referred  to  as  the  random  sample. 
Its  advantage  lies  in  the  fact  that  no  item  is  accorded  preferential 
advantage  of  being  selected,  thereby  avoiding  all  sorts  of  bias, 
the  chances  of  selecting  a representative  sample  are  maximized, 
and  it  becomes  possible  to  draw  a number  of  statistical  inferences. 
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Social  scientists,  however,  often  must  compromise  with  the 
limitations  of  time  and  money  art'  the  nature  of  the  research,  all 
of  which  can  make  it  virtually  impossible  to  draw  a true  random 
sample.  In  effect  what  this  amounts  to  is  researchers  making  use 
of  captive  audiences  such  as  boys  on  probation,  selecting  indi- 
viduals whose  last  names  begin  with  FA  because  "such  names 
v/ere  probably  randomly  scattered  in  the  various  ethnic  : . 

groups,  " or  taking  every  tenth  name  from  the  telephone  book. 

These  procedures  have  obvious  utility  for  the  researcher  and 
are  capable  of  providing  worthwhile  evidence,  so  lcng  as  the 
inferences  diawn  from  them  are  accompanied  by  a ,ull  explana- 
tion of  the  relationship  between  universe  and  sample. 

Frori  Jjhn  H.  Mueller  and  Karl  F.  Schuessler, 
Statistical  Reasoning  ir»  Sociology 
(Boston:  Houghton  Mifflin  Company,  1961)  pp.  337-357. 
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John  H.  Mueller  and  Karl  F.  Schuessler,  Statlstici*.! 
Reasoning  in  Sociology  (Poston:  Houghton  Mifflin 
Company,  1961),  pp  337-357. 


The  Sampling  Attitude 

Tin  Purpose  of  Sampling,  Thr  broad  aim  of  statistics  is  to  describe  and 
jsuvr* i'kirizc  mass  phenomena  like  births,  deaths,  and  income,  and  their 
interrelationships.  However,  it  is  often  necessary  or  practicable  to  base 
such  description  on  a fraction  of  the  total  aggregate,  and  sometimes  an 
exceedingly  small  one  at  that.  Such  au  expedient  may  be,  and  usually  is, 
quite  satisfactory.  It  is  astonishing  how  effective  a well-selected  frag- 
ment can  be:  a small  snippet  from  a bolt  of  cloth;  a few  drops  of  blood 
from  the  patient's  total  supply;  a few  thousand  survey  votes,  by  which  we 
describe  the  political  intentions  of  millions  of  voters.  Such  procedure 
h standard  practice  in  everyday  social  and  economic  life,  as  well  as  m the 
branches  of  scientific  activity.  In  instances  of  this  kind,  when  the  data 
are  partial  rather  than  complete,  and  when  they  are  used  to  characterize 
the  entire  set,  we  call  the  fi figment  a sample,  avid  the  total  aggregate  a 
universe,  o:  population.  We  name  a specified  value  of  the  universe,  such  as 
the  mean,  a parameter,  and  its  counterpart  in  the  sample  wc  term  a siutistic. 
The  objective  of  sampling  is,  therefore,  to  draw  an  inference  about  the 
parameter,  which  is  unknown,  from  the  sample  statistic  which  is  observed. 
This  process  of  generalizing  in  a prescribed  manner  from  sample  to  universe 
has  come  to  be  known  as  stalislical  inference. 

Although  statistical  inference  as  a formal  quantitative  technique  is 
principally  an  achievement  of  the  twentieth  century,  its  underlying  motiva- 
tion is  as  ancient  as  mankind  itself.  In  his  continuing  effort  to  adjust  to 
the  environment,  man  has  necessarily  been  obliged  to  treat  an  isolated 
experience  as  typical  of  a larger  system  in  order  to  profit  by  it.  Once  stung 
by  a bee,  or  burned  ou  a stove,  he  is  likely  to  view  such  objects  as  persisting 
hazards  to  be  avoided.  On  the  same  principle,  but  in  a more  deliberate 
manner,  he  samples  one  peach  from  the  bushel,  plugs  a watermelon,  or 
give*  the  new  car  a trial  run. 
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Moreover,  fresh  samples  of  experience  continually  stimulate  us  to  re- 
examine our  previous  inferences.  Thus,  the  success  of  one  woman  in 
Congress  will  east  doubt  on  the  previously  held  opinion  that  women  arc 
politically  i. '.competent;  the  grade  records  of  Negroes  in  northern  schools 
will  force  a revision  of  the  stereotyped  belief  that  Negroes  possess  an 
inferior  mentality.  Thus,' we  modify  our  premises  in  response  to  the 
unfolding  sample  evidence.  In  the  language  of  statistics,  wo  afford  our 
initial  hypotheses  an  opportunity  to  be  revised  or  nullified. 

While  every  man  thus  reasons  informally  from  part  to  whole,  statistical 
inference  is  much  more  rigorcus  than  such  mere  folk  practice.  Sampling 
has  come  to  be  a body  of  technical  procedures  which  must  be  deliberately 
applied  and  strictly  adhered  to  if  its  goals  are  to  be  fully  realized.  Thus, 
the  statistical  universe  must  be  well  defined,  and  the  sample  must  be 
properly  drawn  — two  basic  operations  which  are  much  more  involved 
than  appear?  on  the  surface.  In  fact,  the  sampling  system,  as  it  has  evolved 
in  the  twentieth  century,  is  essentially  a technical  accounting  device  to 
rationalize  the  procurement  of  information.  By  this  criterion  of  efficiency, 
it  is  always  preferred  to  complete  enumeration  when  it  yields  data  of 
requisite  accuracy,  since  such  a procedure  is  obviously  more  economical 
in  a busy  world  of  limited  resources. 

Further,  the  logic  of  sampling  is  reinforced  by  the  generally  accepted 
view  of  nature  as  being  orderly  and  predictable.  The  discipline  of  statistics 
has  not  always  enjoyed  such  a favorable  climate  of  opinion,  especially 
when  applied  to  human  behavior.  In  the  mid-nineteenth  century,  Qudtelet 
and  Buckle  were  denounced  as  materialists  because  they  drew  inferences 
from  statistical  records  on  the  recurrences  of  crimes,  suicides,  and  other 
human  actions  that  ran  counter  to  the  then  current  doctrine  of  free  will. 
I*  is  therefore  no  accident  that  the  sampling  method  has  been  cultivated 
most  intensively  and  that  the  sampling  mentality  has  flourished  in  the 
scientifically-minded,  efficiency  oriented  rational  culture  of  the  modern 
Western  world. 

^drautapes  of  Sampling . The  collection  of  sample  data  naturally  requires 
less  time  and  effort  than  does  the  compilation  of  complete  data.  lienee, 
surveyors  of  Amedcnn  public  opinion  universally  avail  themselves  of 
samples  of  respondents,  since  results  based  on  time-consuming  total  enu- 
merations would  be  obsolete  before  they  could  be  tabulated  and  published. 
Similarly,  since  10 10,  sampling  has  been  extensively  employed  by  the 
U.S.  Bureau  of  the  Census  in  its  decennial  enumerations  to  provide  more 
promptly  detailed  descriptions  of  various  characteristics  of  the  American 
population. 

Since  sampling  is  also  less  costly,  it  may  be  quite  feasible  when  the 
financial  burden  of  full  coverage  is  prohibitive.  Although  a free-lance 
sociologist,  for  example,  may  believe  it  desirable  to  interview  every  divorced 
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couple  in  a large  city,  he  may  find  such  a program  impracticable  to  execute 
because  of  limited  financial  resources.  He  will  then  be  content  with  a 
carefully  selected  sample,  BuJ.  this  is  not  an  unusual  restriction.  Research 
workers  in  general  expect  to  arrive  at  valid  generalizations  on  the  basis  of 
sample  materials,  presumed  to  be  representative  of  some  wider  domain. 
Thus,  the  anthropologist  interrogates  a few  native  informants  in  order  to 
reconstruct  the  entire  pattern  of  u culture;  the  L\S.  Children's  Bureau 
relics  on  a sample  of  juvenile  courts  in  order  to  establish  national  trends 
in  delinquency. 

Moreover,  there  are  times  when  sampling  is  no t,  only  practically  advan- 
tageous, but  an  almost  inescapable  necessity.  When  we  destroy  an  object 
in  the  very  act  of  measuring  one  of  its  characteristics,  we  must  either 
sample  only  a few  or  destroy  them  all.  Tins  goes  by  the  name  of  destructive 
sampling.  For  example,  it  is  impossible  to  measure  the  life  span  of  un 
electric  light  bulb  without  ending  its  life;  consequently,  to  approximate 
the  average  life  span  of  a large  lot,  the  manufacturer  necessarily  resorts 
to  a small  sample.  Analogously,  the  physician  cannot  afford  to  sample 
more  than  a lew  drops  of  blood  without  causing  undue  discomfort  in  hi9 
patient. 

But  destructive  sampling  may  sometimes  be  abandoned,  if  we  are  un- 
willing to  tolerate  the  destruction  of  even  a single  item.  Where  human 
life  is  jeopardized,  we  -r<re  not  willing  to  risk  the  loss  of  a single  sample  case. 
And  even  when  the  sampling  is  not  harmful,  social  considerations  inhibit 
us  from  freely  carrying  out  experimentation  on  human  beings.  It  is  only 
when  an-  occasional  life  convict  or  conscientious  objector  voluntarily  sub- 
mits to  a possibly  harmful  experimental  treatment  that  public  opinion 
permits  destructive  sampling  to  be  carried  out  on  human  beings.  In  the 
usual  case,  such  sampling  is  transferred  to  dogs  and  monkeys,  who  serve 
as  stand-ins  for  man. 

Sample  materials  are  thus  the  stuff  of  which  scientific  generalizations 
are  made,  but  the  sampling  method  involves  more  than  mere  recognition 
of  that  principle.  Modem  sampling  practice  is  distinguished  by  (1)  its 
emphasis  on  the  well-defined  universe,  (2)  the  random  selection  of  cases, 
and  (3)  the  estimate  of  the  reliability  of  the  sample  statistic  — that  is, 
how  closely  it  probably  conforms  to  the  unknown  parameter.  These 
concepts  form,  the  basis  of  the  discussions  in  this  and  the  following  chapter. 

The  Statistical  Universe  {Population) 

Definition  oj  Universe.  h\  colloquial  speech,  the  term  uninrsc  suggests 
the  entire  Creation.  Put  in  statistical  language,  it  refers  merely  to  a 
totality  of  values  possessed  by  elements  having  a well-defined  common 
characteristic  which  determines  membership  in  the  set.  Such  sets  are 
also  termed  statistical  populations.  We  may  cite  as  universes:  the  incomes 
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of  al!  American  families  in  1900,  the  opinions  of  all  college  students  on  the 
subject  of  war,  or  the  social-status  ratings  of  all  residents  in  Yankee  City. 
From  these  examples,  \vc  discern  that  the  statistical  universe  may  bo 
conveniently  conceived  of  as  being  dualistic  in  nature:  (a)  one  dimension 
consisting  of  the  units  (e.g.,  families)  which  are  actually  sampled  and  which 
are  therefore  called  sampling  units;  (b)  the  other  dimension  being  the 
sampling  trait  (c.g.,  income)  possessed  by  the  sampling  units,  which  is 
subsequently  manipulated  statistically. 

It  is  the  sampling  units  that  arc  physically  selected:  families  with 
incomes,  farms  with  acres,  students  with  opinions,  workers  with  occupa- 
tions. Further,  the  well-defined  criterion  which  determines  eligibility  for 
such  selection  here  applies  to  the  sampling  units  rather  than  to  the  traits. 
Thus,  a population  of  families  consists  of  all  human  groups  that  satisfy 
the  working  definition  of  a family. 

On  the  other  hand,  it  is  the  variable  properties  of  the  sampling  units 
that  command  our  ultimate  interest.  We  are  seldom  if  ever  interested 
in  families  families;  rather  our  interest  will  lie  in  one  or  more  of  their 
relevant  traits:  income,  nationality,  si?.e,  social  status,  religion,  and  so  on. 
It  is  variables  such  as  these  that  are  subjected  to  statistical  measurement 
after  the  sampling  units  have  been  drawn  and  their  characteristics  deter- 
mined. 

When,  for  any  reason,  it  is  unnecessary  to  identify  explicitly  both  aspects 
of  the  universe,  wc  may  quite  properly  employ  elliptical  statements  such 
as  "tho  population  of  college  students’1  or  “ the  universe  of  attitudes/’ 
But  such  abbreviated  statements  omit  the  sampling  trait  of  attitude  (of 
the  college  students)  in  the  first  instance,  and  the  sampling  unit,  the  college 
student  (whose  attitudes  are  polled),  in  the  second  instance.  Although 
terminology  varies  among  statisticians,  we  here  conceive  of  the  ‘ universe” 
as  comprising  both  the  sampling  unit  and  the  corresponding  sampling 
trait. 

Universe  Classification . Statistical  universes  differ  in  various  ways  and 
may  therefore  be  classified  according  to  various  criteria.  Three  of  the 
more  important  classifications  are  here  briefly  set  forth  to  amplify  the 
sampling  concept  and  refine  its  implementation: 

(1)  The  universe  will  be  qualitative  or  quantitative  according  to  whether 
the  traits  of  which  it  is  composed  are  attributes  or  variates.  This  familiar 
distinction  implies  that  the  statistical  description  of  the  universe  will  take 
the  form  either  of  arithmetic  averages  or  frequency  counts  and  percentages. 
For  example,  the  incomes  of  American  families  may  be  represented  by  the 
mean,  whereas  the  occupations  of  American  workers  can  only  be  grouped 
and  counted  and  expressed  as  percentages  of  the  total. 

(2)  The  universe  may  be  finite  or  infinite,  depending  on  whether  the 
sampling  units  are  finite  or  infinite  in  supply.  By  definition,  a finite  uin- 
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verse  contains  a countable  number  of  elements.  It  may  be  relatively 
small,  as,  for  example,  all  students  enrolled  in  a particular  college  in  a 
given  year;  or  it  may  be  relatively  large,  as  all  college  enrollees  in  tlie 
United  States.  Bui  the  infinite  universe  consists  of  an  endless  number  of 
elements,  such  as  an  unlimited  number  of  penny  tosses  or  other  experi- 
mental trials.  It  is  thus  purely  conceptual,  and  may  even  seem  meta- 
physical to  the  finite  mind.  Ami  yet  it  is  often  heuristically  postulated  in 
statistical  inference.  For  one  reason,  an  infinite  population  permits  the 
reliability  of  the  sample  findings  to  be  more  simply  evaluated  by  formula 
than  does  a finite  population.  Consequently,  we  resort  to  the  assumption 
of  an  infinite  population  whenever  the  size  of  a finite  population  is  large 
enough  to  justify  it.  Additionally,  it  may  be  invoked  when  it  is  not 
reasonable  to  limit  the  size  of  the  universe  at  all,  as  in  the  case  of  infinitely 
repeatable  experimental  trials.  The  laboratory  dog  may  die  after  an 
experimental  injection;  but  the  scientist  does  not  restrict  his  generaliza- 
tion to  the  dead  dog.  His  ultimate  interest-  lies  in  the  potentially  endless 
succession  of  experimental  trials  ideally  performed  under  identical  condi- 
tions. Onto  this  infinitely  large  universe  he  fastens  his  generalization. 
Analogously,  but  at  considerably  greater  risk,  the  social  scientist  may 
conceptually  extend  bis  findings  in  one  or  a few  cases  to  the  hypothetical 
infinite  class  of  presumably  identical  events.  One  family,  one  community, 
one  culture,  one  bureaucracy  — each  in  its  turn  serves  as  a prototype  to 
which  all  future  occurrences  of  the  same  general  class  arc  presumed  to 
conform. 

(3)  The  universe  which  is  actually  sampled  (the  sampled  universe)  null 
not  always  coincide  with  the  universe  on  which  our  sights  are  fixed  (the 
target  universe ).  The  target  universe  represents  ideally  the  territory  we 
intend  our  generalizations  to  cover — the  domain  to  which  we  eventually 
apply  our  sampling  knowledge.  Our  ultimate  interest  may  lie,  for  instance, 
in  the  patterns  of  adjustment  of  all  married  students  on  campuses  in  the 
United  States,  but  for  practical  reasons  it  may  be  necessary  to  restrict 
the  sampling  to  the  available  couples  on  a particular  campus,  and  these 
couples  then  become  the  sampled  universe.  When  a mailed  questionnaire 
is  returned  by  less  than  100  per  cent  of  the  sample,  as  is  usually  the  case, 
we  may  conceive  of  the  target  universe  as  the  complete  mailing  list  from 
which  the  sample  was  selected,  and  the  sampled  universe  as  all  persons 
on  that  list  who  would  theoretically  return  that  questionnaire  if  given  an 
opportunity  to  do  so.  Thus,  the  concept  target  universe  may  be  applied 
broadly  to  an  idealized  extension  of  the  sampled  universe,  or  narrowly  to  a 
universe  in  which  a fraction  of  the  units  are  for  oue  reason  or  another 
inaccessible  to  measurement,  such  as  refusals  or  not-at-homes. 

Strictly  speaking,  statistical  inference  should  be  rigorously  limited  to 
the  sampled  universe;  and  yet  the  social  analyst  can  hardly  refrain  from 
speculating  about  his  target  universe.  No  research  study  in  the  social 
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sciences  would  ever  bo  made  if  its  findings  could  not  be  imaginatively 
extrapolated  beyond  the  limited  universe  from  which  the  sample  has 
been  tiers ed.  Such  speculations  are  both  justifiable  and  desirable,  pro- 
vided that  their  tentative  nature  is  recognized  and  understood.  A sam- 
pling study  o?  mental  health  and  social  class  in  a New  England  community 
is  of  significance  primarily  for  the  light  that  it  sheds  on  the  relation  between 
social  structure  and  personality  deterioration  in  other  American  communi- 
ties. And  so  the  investigator  quite  naturally  probes  his  sample  materials 
for  their  wider  generality  and  makes  the  most  of  his  costly  data.  In  pro- 
jecting his  findings  onto  the  vaguely  defined  target  universe  he  necessarily 
proceeds  without  benefit  of  strict  reliability  procedures.  Yet  he  may  be, 
and  usually  is,  engaged  in  fruitful  and  necessary  scientific  activity.  Never- 
theless such  a libera!  statistical  morality,  which  is  practiced  by  even  the 
most  thoughtful  and  productive  social  scientists,  is  by  no  means  license 
for  irresponsible  and  sloppy  statistical  generalizations.  It  should  be  per- 
mitted only  to  experienced  investigators. 


Sampling  Procedures 

Problems  cf  Sampling*  The  process  of  sampling  is  in  its  scientific  sense 
a technical  operation  which  must  be  conducted  according  to  standard 
prescriptions  in  order  to  secure  all  of  its  benefits.  In  fact,  costly  social 
investigations  have  sometimes  been  severely  blemished  because  the  sam- 
pling tactics  were  crude  ancl  inadequate.  Because  of  the  admitted  diffi- 
culties in  sampling  human  populations,  the  discussion  of  the  theory  ancl 
practice  of  sampling  must  always  occupy  an  important  place  in  the  do- 
main of  social  statistics. 

Consider,  by  way  of  illustration,  the  task  of  sampling  a set  of  college 
students  — a common  assignment  for  majors  in  journalism  or  sociology 
— with  a view  to  generalizing  about  the  entire  student  body.  Dis- 
regarding momentarily  the  kind  of  data  sought  — attitude  toward  com- 
munism, number  of  dates  per  week  — how  may  we  obtain  a sample 
whicli  will  do  justice  to  the  student  body?  This  is  no  simple  task.  The 
sampler  might  give  way  to'  the  first  impulse  to  take  all  persons  enrolled 
in  elementary  sociology  — a rather  attractive  possibility  since  such  classes 
include  a wide  variety  of  students  and,  in  addition,  are  easily  reached 
and  manipulated  for  experimental  or  survey  purposes.  However,  on 
second  thought,  he  will  realize  that  college  courses  are  almost  sure  to 
exert  some  selective  influence  among  students.  Thus,  sociology  is  likely 
to  attract  persons  whose  primary  interest  is  in  social  issues,  but  may 
hold  no  appeal  for  those  principally  interested  in  the  physical  world, 
in  restricting  the  sampling  to  sociology  students,  there  is  therefore  a 
danger  of  excluding  certain  types  of  individuals  who  are  negatively 
selected  by  this  subject.  To  forestall  this  outcome,  we  might  propose 


John  H.  Mueller  and  Karl  F.  Schuebsler 


using  pc-rs-Oiis  enrolled  in  English  Composition,  as  this  subject  is  required 
of  all.  Again,  reasonable  enough  on  first  thought,  this  alternative  will 
seem  less  satisfactory  when  we  realise  that  only  mulcrclnssmcn  are  en- 
rolled; mature  upperclassmen  would  liave  no  opportunity  to  be  included 
in  the  sample. 

Many  other  possibilities  will  readily  suggest  themselves  in  the  search 
for  representative  coverage.  Thus,  we  might  consider  canvassing  the 
men's  dormitories.  But  this  scheme  rather  glaringly  omits  women,  and 
so  would  have  to  be  modified  io  include  women’s  residence  halls  as  well. 
Even  with  this  modification,  the  plan  is  rather  obviously  unsound:  it 
makes  no  provision  for  students  housed  in  fraternities  or  for  (hose  living 
at  home.  Apparently,  other  plans  will  have  to  be  tried  until- an  adequate 
one  is  developed. 

The  general  shortcoming  of  the  aforementioned  alternatives  wall  be 
recognized  by  even  the  casual  reader:  they  do  not  afford  each  sampling 
unit  an  equal  opportunity  of  being  selected.  Ideally,  of  course,  our 
sampling  procedure  should  give  to  each  unit  in  the  population  such  an 
opportunity.  Methods  wnieh  meet  that  criterion  are  named  random 
sampling  procedures;  methods  which  offer  no  such  assurance  may  be 
conveniently  labeled  nonrandom  procedures.  When  such  equal  oppor- 
tunity of  selection  is  not  provided  for  by  the  sampling  technique,  it  will 
generally  be  impossible  to  vouch  for  the  representativeness  (reliability) 
of  (he  sample,  and  consequently  it  will  be  impossible  to  generalize  con- 
fidently from  the  sample  back  to  the  universe.  The  fulfillment  of  the 
purpose  of  sampling  is  jeopardized. 

X on-random  procedures  are  therefore  seldom  considered  to  be  ideal; 
nevertheless,  they  are  often  justifiably  resorted  to  in  social  research 
because  of  practical  necessity.  Beginning  with  the  least  useful,  several 
are  presented  here  in  ascending  order  of  utility  so  that  the  student  will 
be  familiar  with  their  respective  merits  and  shortcomings. 

Non-random  Procedures 

Haphazard  Sampling.  The  acceptance  of  whatever  cases  one  fortui- 
tously happens  to  encounter,  without  any  consideration  whatsoever  for 
their  degree  of  representativeness,  may  be  termed  haphazard  sampling.* 
This  practice  is  exemplified  by  the  old-fashioned  straw  vote  in  which 
citizens  arc  accosted  in  the  street  to  ascertain  their  voting  intentions 
From  these  straw’s  in  the  wind,  the  election  “forecast"  is  made.  The 
obvious  objection  to  such  casual  procedure  is  that  the  man-in-the-street 
simply  is  not  representative  of  the  total  electorate.  Similarly,  in  a sur- 
vey of  student  opinion  on  the  propriety  of  final  examinations,  those 

* Synonymous  concepts  that  appear  in  statistical  writing  include  accidental  sampling 
and  convenience  sampling. 
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nightlmwks  whom  we  happen  to  find  breakfasting  in  the  coffee  shop  at 
11  a.m.  will  be  something  less  than  a fair  cross-section  of  the  entire  stu- 
dent body.  Such  crude  chunks  of  data  are  "samples”  only  in  the  loosest 
sense  of  that  term.  When  seriously  used,  they  constitute  an  unflattering 
reflection  on  the  sophistication  of  tho^c  who  report  to,  and  accept,  such 
data. 


Availability  Sampling,  Although  fully  aware  of  the  limitations  of  non- 
random  sampling,  the  experienced  social  scientist  will  sooner  or  later 
realize  that  some  form  of  it  i often  the  only  alternative  to  abandoning 
the  inquiry.  In  many  instances,  he  may  be  required  therefore  to  seize 
whatever  opportunity  is  available.  The  Kinsey  survey  of  sexual  behav- 
ior in  the  United  States  male  population  was  severely  criticized  because 
it  was  based  to  a large  extent  on  data  provided  by  solicited  subjects  who 
made  themselves  available.  It  was  plausibly  contended  that  person* 
who  volunteer  to  provide  information  about  their  sexual  behavior  arc 
likely  to  differ  significantly  from  poisons  who  decline  to  be  interviewed. 
Hut  the  investigators  considered  the  acceptance  of  volunteer  respondent* 
ns  a pragmatic  solution,  lather  than  the  preferred  procedure.  It  was 
their  reasonable  assumption  that  many  random  selectees  could  not  be 
induced  to  relate  their  sex  histories,  and  that  any  possibility  of  obtaining 
a completely  random  sample  was  precluded. 

For  similar  reasons,  other  types  of  sociological  studies  must  often  rely 
on  available  opportunities.  For  example,  it  may  be  impossible  for  the 
behavioral  scientist  to  sample  all  workers  in  a given  industry,  but  yet 
possible  to  observe  workers  in  a local  plant;  impossible  to  poll  a group 
selected  from  nil  school  children  in  the  slate,  but  possible  to  poll  those 
attending  various  local  schools.  Social  psychologists  are  likely  to  fmd 
little  humor  in  the  sarcastic  remark  that  their  broad  "universal  9 gener, di- 
lations arc  founded  on  experiments  on  college  sophomores.  Above  all, 
they  would  prefer  to  study  all  kind*  of  individuals  — old  as  well  as  young, 
non-college  a?  well  as  college  — in  formulating  the  laws  of  social  learn- 
ing. Hut  it  is  often  a question  of  sampling  the  available  students  or 
otherwise  abandoning  the  project.  Hence,  they  must  convert  a captive 
audience  of  students  into  a "valid”  sample.  Similarly,  research  on 
delinquency  is  almost  always  based  on  the  available  delinquents  — boys 
on  probation,  boys  in  court,  or  boy*  in  the  industrial  school.  Regardless 
of  how  desirable  A might  lx?  to  study  a group  selected  from  ah  delin- 
quents the  unapprihended  as  well  as  the  apprehended,  such  comprehen- 
sive sampling  is  for  obvious  reasons  out  of  the  question. 

It  would  therefore  be  pedantic  to  deny  the  uses  of  available  opportu- 
nities. even  though  they  do  not  yield  ideal  data.  Social  scientist*,  like 
everyone  el^o,  must  often  content  themselves  with  compromises.  Not- 
withstanding their  shortcomings  availability  simple*  do  yield  siguifi- 
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cant  information  and  insights.  For  example,  the  Burgess  and  Cultrcll 
sample  of  526  married  couples  produced  many  revealing  propositions  on 
(he  positive  Mid  negative  factors  in  marital  happiness;  Znaniecki's  re- 
nowned sample  of  letters  written  by  Polish  peasants  provided  suggestive 
hypotheses  on  basic  human  motives.  Moreover,  inexpensive  captive 
samples  — college  classes,  prison  inmates,  members  of  the  armed  forces 
— can  often  be  utilized  to  develop  rcsiareh  techniques  and  to  supply 
experience  in  applying  them.  No  one  would  seriously  protest  against 
studying  hospital  patients  because  they  dc  not  perfectly  represent  all 
diseased  persons;  few  would  abandon  the  study  of  prison  inmates  merely 
because  they  are  not  an  exact  replica  of  the  criminal  population.  Avail- 
ability samples  arc  quite  legitimate,  so  long  as  the  inferences  drawn 
from  them  are  accompanied  bv  reservations  which  are  made  necessary 
by  the  ill-defined  relation  between  universe  and  sample. 

Judgment  (Purposive)  Sampling.  When  the  composition  of  the  universe 
is  known  to  the  sampler,  and  when  a small  sample  of  only  one  or  a few 
items  is  required,  the  sampler  may  elect  to  lcly  on  his  mature,  sound 
judgment  in  choosing  the  sample.  Judgment  sampling  will  therefore  be 
most  effective  in  the  hands  uf  an  expert  who  knows  bis  population,  and 
who  can  readily  spot  (he  typical  ease*  Why  should  he  choose  by  lot, 
and  thereby  run  the  risk  of  drawing  an  extreme  case,  when  he  is  confi- 
dent that  it  is  possible  to  obtain  a representative  item  by  deliberate 
choice?  lie  legitimately  pits  his  personal  judgment  agrnst  the  opera- 
tion of  (lie  laws  of  probability.  The  Lyuds  did  not  leave  the  choice  of 
Middletown  to  blind  chance;  for,  if  they  had  done  so,  a highly  atypical 
community  might  have  presented  itself  for  study  and  defeated  the  very 
purpose  of  the  investigation.  These  sociologies  were  interested  in  a 
single  modal  community,  as  indicated  by  their  choree  of  pseudonym. 
Judgment  sampling  is  therefore  especially  well-suited  to  the  ense  study t 
in  which  many  aspects  of  a single  representative  ease  are  observed  and 
analyzed. 

While  judgment  sampling  is  considered  efficient  for  selecting  only  one 
or  two  items,  it  Ins  also  been  applied  on  occasion  to  procure  samples  of 
considerable  bulk.  The  Time  Magotine  sample  of  V.S.  college  graduates 
(1952),  consisting  of  individuals  whose  last  names  began  with  "Fa,”  "*as 
justified  by  the  surveyors  on  the  ground  that  such  names  w*er~  probably 
randomly  scattered  in  the  various  ethnic,  religious,  and  socio-economic 
groups  of  the  population.  And  since  a genuine  randomized  selection 
would  have  required  a much  more  cumbersome  sampling  apparatus, 
along  with  trained  hands  to  operate  il,  the  “Fa*'  type  of  sample  was 
more  practicable  and  hence  in  this  instance  the  prcfvircd  alternative. 

However,  the  seductive  plau^bilily  of  judgment  sampling  should  not 
be  permitted  to  conceal  its  hazards.  Even  the  expert’s  judgment  is  sub- 
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jcct  to  human  bins  and  mor.  And  even  when  the  universe  is  in  plain 
fright,  it  is  probable  that  the  observer  will  misjudge  to  some  extent  the 
representativeness  of  a sample.  To  demonstrate  such  bias,  the  English 
statistician  Yates  once  requested  12  persons  to  select  three  samples  of 
20  items  each  from  a collection  of  1,200  stones,  each  sample  to  represent 
as  accurately  as  possible  the  size  distribution  of  that  experimental  uni- 
verse. Although  the  observers  were  free  to  view  t lie  collection  at  their 
leisure,  still  thcic  was  a consistent  tendency  to  exaggerate  the  average 
sue  of  the  stones  and  to  minimize  their  variation.  In  short,  there  was 
a constant  error  in  judgment.  We  cannot  be  sure  that  analogous  errors 
did  not  bias  the  fTa”  sample,  because  of  the  practical  certainly  of 
human  bias,  judgment  sampling  must  therefore  lx?  applied  with  great 
caution.  Hut  whatever  the  limitations  of  non-random  samples  may  be, 
the  techniques  of  descriptive  statistics  must  obviously  bo  competently 
applied  in  order  to  assure  their  maximum  utility. 

Random  Sampling  Procedures 

Ihfuiition  of  Random  Sampling.  If  available  samples  arc  fallible,  and 
expert  judgi'.'iit  is  not  to  be  trusted,  then  what  are  the  factors  that 
should  determine  the  composition  of  the  sample?  The  answer  is:  chance 
factors.  However,  in  view  of  the  low  regard  in  which  chance  factors 
are  usually  held  as  guides  to  action,  it  is  surprising  that  \vc  should  so 
willingly  lay  aside  our  cumulated  knowledge  and  experience  and  go  to 
the  other  extreme,  permitting  blind  chance  to  determine  the  choice  of 
the  sample.  In  most  human  situations,  wc  wish  to  eliminate  chance, 
since  it  disturbs  our  predictions.  Nevertheless,  the  idcoi  sampling  pro- 
cedure is  one  in  which  the  drawings  are  affected  by  impartial  chance 
factors  alone,  with  the  result  that  one  item  in  the  universe  is  ns  likely 
to  be  included  in  the  sample  as  another.  No  item  is  accorded  a pref- 
erential advantage.  In  fact,  random  sampling  is  defined  as  a procedure 
that  provides  an  equal  opportunity  of  selection  to  each  unit  in  the  popu- 
lation. Wc  eliminate  personal  bias  by  elimination  of  the  person,  and 
thereby  permit  only  the  play  of  impersonal  chance  forces. 

There  is  nothing  esoteric  about  random  sampling;  most  persons  arc 
familiar  with  a few  homemade  routines  that  assure  an  impartial  random* 
iicd  selection.  The  thoughtful  hostess  wishes  every  guest  to  have  an 
equal  opportunity  to  wi<>  the  door  priio,  regardless  of  wealth  or  friend- 
ship; the  contest  is  to  be  pcifeclly  democratic.  .Accordingly,  everyone 
is  requested  to  write*  hi*  or  her  name  on  a blank  slip  and  drop  it  into  the 
hat  After  the  last  m rival,  the  dips  are  thoroughly  mixed,  and  one 
name  is  drawn  and  declared  the  winner.  Although  only  one  persm  can 
win  the  door  prlic,  wc  still  say  that  the  probability  of  winning  wn«  iden- 
tical for  all.  Among  ?*»  guests,  the  probability  of  winning  would  bo  l 
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in  25.  How  is  t his  to  he  inteipreud?  T!ic  laying}  would  ivply:  “Since 
i lie  human  sampler  cannot  influence  the  drawing,  each  guest  has  t lie 
same  chance.”  But  the  statistical  interpretation  lies  in  the  very  defini- 
tion of  probability,  which  applies  to  mass  phenomena  or  conceptually 
repeatable  trials.  In  (his  instance,  “same  probability19  refers  to  the 
expectation  that  each  name  would  be  drawn  an  equal  number  of  times 
if  identical  drawings  were  conducted  indefinitely.  Of  course,  an  empirical 
demonstration  of  innumerable  drawings  is  not  undertaken;  nevertheless, 
it  is  tacitly  assumed  that  the  a priori  expectation  of  equal  occurrences 
would  eventually  be  confirmed.  This  principle  is  intuitively  xocognizcd 
by  the  losing  guests  who  confidently  console  themselves  with  the  bromide, 
“Utter  luck  next  lime.99 

Of  course,  equal  probability  of  selection  cannot  be  assumed  if  some 
cases  arc  less  accessible  than  others  or  if  the  selection  mechanism  is  func- 
tioning imperfectly.  Either  of  these  related  contingencies  will  defeat 
the  aim  of  random  sampling  and  result  in  biased  sampling.  Thus,  in 
selecting  a sample  of  10  slips  from  a receptacle  containing  200,  random- 
nc*«  would  be  precluded  if  the  slips  were  unequal  in  size,  shape,  or  weight; 
or  if  they  were  carelessly  mixed  so  that  the  last  names  dropped  into  the 
hat  would  have  the  best  opportunity  of  being  drawn.  Hut  the  outward 
appearance  of  the  extracted  sample  would  never  reveal  such  procedural 
flaws,  or  biasing  factors.  The  composition  of  a non-random  sample  is 
visibly  no  different  from  that  of  a sample  of  the  same  size  selected  by 
random  procedure*;  the  sampling  operation  leaves  no  telltale  mark. 
Thus,  if  we  were  to  come  upon  two  different  scatters  of  ten  coins  each 
shewn  oil  two  tables,  the  one  all  heads  and  the  other  showing  six  heads 
and  four  tails,  no  amount  of  visual  inspection  would  tell  us  whether  either 
or  both  of  them  had  been  carefully  laid  down  or  whether  they  had  been 
tossed  at  random.  Nevertheless,  the  observer  will  intuitively  conclude 
that  the  first  Inal  probably  been  laid  down  and  the  second  had  been  arbi- 
trarily tossed.  Why?  Because  the  probability  of  10  heads  together  is 
so  small,  and  the  division  of  0 and  4 much  greater.  In  fact,  the  prob- 
abilities compare  as  1 to  210.  Still,  he  can  never  bo  certain.  Similarly, 
in  the  case  of  our  hostess,  the  name  on  the  winning  slip  can  never  betray 
whether  the  drawing  was  honest  or  biased. 

However,  a suspicion  of  biased  sampling  will  be  aroused  by  the  sub- 
sequent discovery  of  a inmkcd  discrepancy  between  the  sample  value 
mid  the  true  value  of  the  universe,  when  and  if  the  latter  becomes  known. 
The  now  classic  exarplc  of  such  a melancholy  outcome  is  the  notorious 
Lit  f rot  y Digest  Presidential  pre-election  poll  of  1030.  While  Roosevelt 
obtained  approximately  GO  per  cent  of  the  popular  vote  in  the  actual 
election,  his  percentage  in  the  Literary  Digest  sample  of  over  2,000,000 
respondents  was  only  40  ptt  cent  — a difference  of  20  |*ru-ntage  points! 
This  di-crepaney  in  such  a lai^e  sample  was  symptomatic  of  a gross 
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defect  in  flic  sampling  procedure,  which,  upon  later  review  end  analysis, 
statisticians  easily  established.  Although  such  a discrepancy  could  have 
occurred  by  chance,  it  would  have  been  extremely  unlikely.  Hence  the 
deduction  that  the  method  of  sampling  was  biased.  But  such  appraisals 
arc  always  retrospective.  In  advance  f the  sampling,  one  can  only  pro- 
vide for  adequate  machinery  which  will  be  reasonably  certain  to  yh id  a 
random  selection. 

Simple  Random  Scnitplwg.  If  called  upon  to  devise  a do-it-yourself 
sampling  technique,  the  inexperienced  layman,  like  our  hostess,  would 
probably  procure  as  many  slips  ns  there  arc  items  in  the  population  to 
be  sampled.  Slips  of  paper  are  more  easily  shuffled  than  people.  We 
may  summe  that  he  would  then  number  these  slips  consecutively  from 
1 through  A\  corresponding  to  the  numbered  units  of  the  universe.  Xext, 
he  would  place  the  slips  in  a suitable  receptacle  and  mix  them  until  the 
set  was  thoroug1ily  scrambled.  Finally,  he  would  reach  in  and  fake  out 
ns  many  cases  as  desired.  If  he  wished  a sample  of  10  cases,  he  would 
take  out  10  different  slips.  Statisticians  recognize  this  as  the  simplest 
type  of  random  sampling  and  have  therefore  dubbed  it  aimpfc  random 
la  mptitij. 

They  do  not  casually  describe  it  ns  " reaching  in  and  drawing  out  n 
different  items,”  Bather  they  define  it  as  that  procedure  in  which  every 
distinct  sample  of  n items  has  an  equal  probability  of  selection  from  a 
finite  population  of  X items.  This  definition  expresses  the  long  run 
consequence  of  <r reaching  in  and  taking  out  n different  items.”  For, 
if  that  procedure  were  applied  indefinitely  to  a given  population  (re- 
storing the  entire  sample  after  cacli  trial),  each  different  sample  would 
tend  to  reappear  an  equal  number  of  times.  In  its  procedural  aspect, 
simple  random  sampling  is  the  apparatus  that  guarantees  the  fulfillment 
of  this  criterion  of  equal  probability.  In  its  substantive  aspect,  it  is 
the  vc;y  criterion  itself. 

To  unfold  further  the  meaning  of  simple  random  sampling,  let  u*  con- 
sider the  number  c-f  ways  in  which  samples  of  2 items  can  be  selected 
from  a miniature  population  of  5,  whose  members  we  shall  designate: 
a,  b,  c,  d,  c.  By  manipulation,  we  discover  that  there  arc  10  different 
possible  combinations,  or  samples: 

ab  be  cd  de 

ac  I'd  ce 

ad  be 

ac 

Hence,  the  probability  of  each  combination  in  simple  random  sampling 
must  \ 3 1 in  10;  each  sample  is  expected  to  occur  on  the  average  once 
in  every  10  trials. 
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For  finite  universes  and  samples  of  any  size,  this  probability  may  be 
expressed  in  terms  of  the  now  familiar  combinatorial  formula: 

Pr  (Given  Sample)  » ^ 

Applying  these  formulas  to  the  illustrative  data  given  above,  we  have 
Cf  (Total  Possible  Samples)  * « 10 

Pr  (Given  Sample)  » yg 
which  agrees  with  the  previous  results. 

Sampling  by  Random  Digit*.  A rudimentary  technique  for  carrying  out 
simple  random  sampling  Ins  already  been  set  forth:  (1)  represent  units 
on  slips;  (2)  thoroughly  scramble;  and  (3)  draw  the  required  number 
of  slips.  However,  the  drawing  need  not  be  laboriously  carried  out  in 
this  manual  fashion.  It  will  usually  be  inefficic  it  to  do  so,  especially 
when  the  population  is  large.  It  is  i»  u«.’*  move  \ racticable  to  substitute 
a table  of  randomly  ordered  digits  for  shulllod  numbered  slips  — a 
common  procedure  of  research  workers.  This  technique  necessarily  re- 
quires that  we  number  serially  all  of  the  units  in  the  population  from  1 
through  A*  and  then  draw  from  (he  corresponding  table  of  random  digits, 
as  many  different  numbers  (combinations  of  digits)  as  there  arc  cases  to 
be  included  in  the  sample.  The  cases  whose  serial  numbers  correspond 
to  those  drawn  from  the  tabic  constitute  the  sample.  Tabic  IV  of  the 
Appendix  presents  such  a list  of  random  numbers. 

Oilnr  RanJont  &tMpliug  PrcuJura 

Simple  random  sampling  is  the  most  primitive  and  unrestricted  selec- 
tion procedure  and  most  clearly  exposes  the  essential  operation  of  ran- 
domness. Being  free  of  procedural  n ' lifications  which  arc  often  made 
necessary  by  practical  circuimf  nices,  tl  is  conceptually  the  simplest  of 
all  sampling  routines  and  is  therefore  so  lal'cled.  Since  it  is  random  sam- 
pling in  its  most  uncomplicated  form,  it  serves  as  a standard  of  sampling 
efficiency  against  which  other  lyfrts  are  compared  and  evaluated.  These 
other  types  are  made  necessary  by  the  fact  that  simple  random  sampling 
in  it?  pure  form  can  almost  never  be  employed  in  large-scale  social  re- 
search. It  is  far  too  impractical  and  costly,  and  often  even  impossible. 
Xeveitlicless,  it  is  essential  that  the  student  cleaily  comprehend  its  basic 
characteristics  in  order  to  1*0  able  to  rtcognire  and  appreciate  the  degree 
to  which  the  alternative  methods  depart  /ran  this  standard  model. 
Three  of  the  most  prevalent  alternative  types  are  here  set  forth  as  ft  brief 
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introduction  to  the  subject:  (1)  stratified,  (2)  cluster , and  (3)  interval  sam- 
pling. Those  sampling  devices  themselves  are  largely  an  outgrowth  of  the 
sheer  practical  problems  that  have  arisen  in  the  sample  surveys  of  largo 
human  aggregates,  which  accounts  for  their  wide  currency  in  social  science* 

Stratified  Sampling . “Do  the  citizens  of  Ilrownville  favor  racial  inte- 
gration of  schools? rt  No  competent  surveyor  of  public  opinion  would 
attempt  to  answer  that  question  without  canvassing  both  white  and  Negro 
residents  of  the  community.  Public  sentiment  on  this  issue  would  not 
be  reliably  portrayed  by  a sample  that  slighted  cither  group,  since  their 
opinions  are  so  divergent.  Vet  such  an  imbalance  between  groups  might 
occur  under  simple  random  sampling,  unless  the  sample  wove  made 
large  enough  to  forestall  that  eventuality.  A more  economical  alternative 
would  be  to  sample  each  subgroup  separately  and  combine  the  results, 
and  thereby  avoid  a costly  inflation  of  sample  size.  Such  an  operation, 
which  first  separates  the  entire  population  into  relevant  strata  before 
randomly  drawing  the  sample,  is  known  as  stratified  snmpfmy. 

From  a procedural  standpoint,  stratified  sampling  therefore  consists 
of  the  following  stages:  (1)  division  of  the  total  universe  inlo  subclasses, 
or  strata;  (2)  the  selection  of  a random  sample  from  each  stratum;  and 
(3)  the  consolidation  of  the  svbsatnple  statistics  into  a combined  statistic 
weighted  for  sire  of  strata,  in  this  context,  the  term  stratification  does 
not,  of  course,  connote  a hierarchy,  a«  the  ranks  of  an  army,  or  the  geo- 
logical scams  of  the  caith  crust;  rather  it  signifies  the  categories  of  a 
statistical  variable,  such  i \c  race,  sex,  or  religion,  into  which  the  total 
population  is  conveniently  divided. 

Nor  does  stratification  imply  a relaxation  of  the  requirement  of  random- 
ised selection,  although  that  inference  has  been  sometimes  mistakenly 
drawn.  This  misconception  may  possibly  reflect  a failure  to  distinguish 
clearly  between  stratified  inndom  sampling  and  so-called  S'irnpfm?, 
which  was  formerly  widely  used  in  opinion  polling.  In  quota  sampling, 
quotas  ate  pro-assigned  to  strata,  but  the  final  selection  of  eases  is  left 
to  the  discretion  of  the  interviewer.  However,  if  the  benefits  of  random 
sampling  are  to  be  attained,  subsamples  from  strata  must  be  randomly 
chosen.  The  resort  to  convenience  or  judgment  sampling  is  no  more 
warranted  within  a given  stratum  than  it  i<  in  the  whole,  un stratified 
population,  and  is  almost  certain  to  lead  to  biased  result-. 

Since  stratified  sampling  i-  more  complex  than  simple  random  sam- 
pling, we  may  rightly  ask  what  are  its  compensating  advantage?,  briefly 
put,  it  i<  ft  labor-saving  device  for  securing  equivalent  accuracy  with 
fewer  eases  than  is  likely  under  simple  random  sampling.  It  i«  essen- 
tially for  this  nnson  that  national  public  opinion  polls  universally  resort 
to  some  form  of  stratified  sampling  in  order  to  keep  the  site  of  the  sample 
down  to  manageable  [ roporliens.  Thus  suhsamplcs  ate  usually  *c loctM 
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from  each  of  the  broad  geographical  regions  of  the.  United  States,  it  being 
plausibly  assumed  that  public  opinion  on  many  issues  varies  from  one 
sector  to  another:  the  Midwest,  for  example,  is  internationally  more 
isolationist  than  the  Knsf.  Similarly,  subsamples  are  commonly  drawn 
from  various  age  and  economic  levels,  as  these  factors  are  also  known 
to  exert  an  influence  on  the  content  and  intensity  of  public  opinion.  Older 
persons  arc  generally  more  conservative  than  younger;  political  opinions 
(end  to  parallel  economic  interests.  On  the  other  hand,  national  opinion 
polls  would  never  stratify  the  population  by  hair  color,  since  there  U 
probably  no  genuine  correlation  between  hair  color  and  political  opinion. 
In  general,  stratification  serves  no  purpose  when  the  M ratifying  factor 
is  unconelat^d  with  the  sampling  trait  being  measured. 

Comparison  of  Strata,  While  the  principal  justification  of  stratified 
sampling  is  equal  accuracy  with  a smaller  sample,  the  comparative  data 
which  are  its  natural  by-product  provide  an  additional  inducement  for 
using  it.  Tims,  the  decision  to  stratify  by  occupation  in  a survey  of 
opinion  on  labor  unions  may  be  prompted  as  much  by  the  wish  to  com* 
pare  the  characteristics  of  the  various  occupational  classes  as  by  the  need 
to  economize  on  sample  size.  The  differences  among  occupational  cate- 
gories (Table  11.1)  on  the  question  "Aie  you  in  favor  of  labor  unions?” 


•tffifi'ck  Toward  Labor  Unions  by  Qccu* 
Talk  11,1  ptr/foN,  P(re<nta$c  Distribution,  C.S . 
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may,  in  fact,  be  even  mow  pertinent  and  revealing  than  the  over-all 
weighted  average  of  07  per  cent,  which  necessarily  conceals  such  differ- 
ences. Since  stra1 ' arc  often  treated  individually,  it  ha*  been  suggested 
that  the  term  "domains  of  study  ’’  be  applied  to  strata  when  they  arc 
being  analyzed  in  ibis  segregated  manner. 
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Chtstcr  Sampling.  In  the  on?.'  of  the  drawing  for  the  door  prize,  it  was 
obviously  necessary  to  have  a complete  list  of  eligible  names,  from  which 
the  winning  one  was  randomly  drawn.  For,  “equal  opportunity  to  be 
included”  implies  not  only  a mechanically  reliable  selection  procedure, 
but  also  a complete  list,  so ‘that  every  name  is  accessible  to  the  draw. 
It  \ould  similarly  be  necessary  to  have  a complete  source  list  of  sampling 
units  if  a simple  random  sample  were  drawn  from  any  other  statistical 
universe,  such  as  the  inhabitants  of  a community,  an  army,  or  a univer- 
sity student  holy.  Hut  such  social  universes  differ  fiom  the  more  or 
less  trivially  small,  easily  manipulated  guest  list  of  a banquet,  When 
the  universe  is  vast,  and  extends  over  a wide  area,  the  compilation  of 
siicli  nn  indispensable  list  is  a laborious  undertaking;  and  even  when 
available  (e.g.,  a city  directory),  it  is  usually  not  up-to-date  on  the  date 
of  issue  because  of  the  mutability  of  the  population. 

If,  therefore,  an  adequate  list  of  elementary  sampling  units  is  not 
available,  we  may  turn  instead  to  more  or  less  permanent  groupings 
into  which  the  population  is  naturally  divided  and  which  can  be  conven- 
iently listed.  Human  beings  are,  of  course,  usually  found  in  prevailingly 
standard  groups:  they  are  cindered  geographically  by  states,  counties, 
municipalities,  neighborhoods,  blocks,  precincts,  and  dwelling  units; 
people  work  together  in  factories,  offices,  ami  stores;  they  are  organized 
in  innumerable  clubs,  lodges,  schools,  and  miscellaneous  associations. 
It  is  these  groups,  or  clusttrs,  which  may  be  serially  utilized  as  sampling 
units,  through  which  we  reach  the  ultimate  elementary  unit  (e.g.,  the 
person  or  household)  which  is  the  objective  of  our  survey.  Such  a pro- 
cedure is  therefore  termed  cfusfrr  sampling.  We  thus  finally  reach  the 
elementary  unit  through  a shorter  or  longer  chain  of  samplings  of  the  more 
easily  listed  dusters.  Since  sampling  is  carried  out  in  successive  stages 
before  reaching  our  destination,  this  lyj>c  of  sampling  is  oho  called  wirdfi* 
stage  tOMpting.  When  the  clusters  at  any  stage  consist  of  tcrritoiaal 
units,  we  may  dcscril*  that  stage  as  orco  sampling. 

Let  us  suppose  that  we  wish  to  survey  the  occupational  ambitions  of 
high  school  pupils,  ages  15  17,  in  Chicago.  It  would  l>e  exceedingly 
laborious  and  financially  prohibitive,  as  well  as  hazardous  in  accuracy, 
to  compile  a lift  of  all  specified  pupils  in  that  metropolis.  However,  a 
permanent  list  of  high  schools  is  easy  to  obtain.  These  schools  could  too 
randomly  sampled,  ami  n list  of  students  in  the  desired  age  categories 
would  then  be  obtained  from  the  comparatively  few  high  schools  in  the 
sample.  From  that  lid,  the  sample  of  students  would  be  finally  drawn. 
Similarly,  if  the  households  of  the  city  were  to  bo  surveyed,  a sample  of 
city  blocks  then  of  dwelling  units  and  finally  of  households  would  bo 
drawn.  Fven  though  such  a multi-stage  procedure  would  still  require  i\ 
source  lid  at  each  level,  the  lids  would  be  smaller  and  more  current.  In 
such  economy  lie*  the  first  advantage  of  the  cluster  approach. 
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However,  the  execution  of  any  social  survey  is  not  fulfilled  by  the  mere 
drawing  of  the  sampling  unit?.  We  still  must  make  the  personal  contact 
with  the  selected  dementajy  units  to  obtain  (he  information  which  was 
the  objective  of  the  study  in  the  fu>t  place.  This  «roblcm  of  personal 
contact  develop?  into  a major  obstacle  in  simple  random  sampling,  which 
may  yield  a sample  whose  elements  are  widely  dispersed  over  a wide  area, 
requiting  a prohibitive  expenditure  of  time  and  energy  to  reach.  It  is 
in  such  a plan  fur  personal  interviews  that  cluster  sampling  displays  an- 
other technical  advantage.  Under  (his  procedure,  the  elementary  unit! 
are  territorially  concentrated,  and  are  therefore  more  ea?Py  accessible, 
v ilh  less  wastage  in  transportation.  Thi*  is  shown  in  Figure  11.1. 


Simple  Random  Sample  Chalet  Sample 

Fid  Hr  1 1.1  Schematic  Diagram,  Simple  Random  and  Clutter 
Sampling 


Hut  the  physical  and  mechanical  convenience  of  cluster  sampling  is 
purchased  at  a price  in  quality.  It  gone » ally  lacks,  to  a degree,  the  very 
characteristic  which  is  the  objective  of  good  sampling:  typicality  and 
representativeness.  Its  reduced  representativeness  is  due  to  the  fact 
that  the  deniCi.lary  sampling  units  within  clusters,  particularly  human 
dusters,  me  likely  to  lx?  closely  similar  in  regard  to  their  social  character- 
istics; consequently,  sampling  within  these  clusters  understates  the 
dispcisioii  and  provides  unnecessary  duplication.  For  example,  the  resi- 
dents of  a given  city  Mock  arc  likely  to  belong  to  the  same  socio-economic 
class,  and  therefore  hold  the  same  political  and  social  opinions.  By  the 
fit  me  token,  they  arc  not  likely  to  lx?  representative  of  any  diversity  of 
public  opinion  in  the  large  aggregate  of  unsampled  neighborhoods;  a 
cluster  sample  is  thus  lift  likely  to  represent  the  variety  of  opinions  than 
A simple  random  sample  of  the  same  size  scattered  over  the  community. 
H is  this  lack  of  heterogeneity  within  dusters  which  reduces  the  com- 
parative effectiveness  of  duster  samples. 

This  liability  notwithstanding,  cluster  sampling  is  being  increasingly 
employed  in  social  research.  For,  after  all,  statistical  work  — like  every 
other  human  endeavor  — is  a compromise  between  the  ideal  and  the 
practical,  and  in  any  case  can  attain  only  an  approximation  of  11  truth.” 
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Interval  (Systematic)  Sampling.  Whenever  the  sampling  nails  are  ar- 
ranged in  some  kind  of  natural  sequence,  like  consecutive  admissions  to 
a hospital  or  library  books  in  the  card  catalogue,  it  r ay  be  economical 
and  even  preferable  to  obtain  a sample  by  taking  eases  at  a fixed  interval. 
Such  a procedure  is  termed  interval  sampling  > or  more  commonly  but 
less  aptly,  systematic  sampling.  The  selection  of  every  tenth  name  from 
the  telephone  directory,  after  a random  start  among  the  first  ten  names, 
illustrates  the  process  of  interval  sampling.  Such  a procedure  has  obvious 
utility  for  the  social  scientist  who  frequently  has  occasion  to  study  a 
series  of  events  such  a<  a file  of  newspapers,  the  characteristics  Of  dwell- 
ing units  in  a given  ecological  area,  the  cases  on  docket  in  a criminal 
court,  or  a card  catalogue  of  welfare  case  records. 

To  establish  the  width  of  the  sampling  interval  (/;)  in  any  given  prob- 
lem, we  merely  find  the  ratio  of  population  size  (.V)  to  desired  sample 
sire  («): 


n 


Thus,  if  the  sample  is  to  contain  5 per  cent  of  the  universe,  or  1 out  of 
every  20  cases,  the  sampling  iutcivnl  obviously  would  l>c  20;  and  wo 
would  d:aw  every  20th  item,  randomly  starting  with  any  number  within 
the  first  interval  of  20.  Such  calculation  presupposes,  of  course,  that 
sample  sire  has  been  fixed  in  advance  of  the  sampling.  Hut  if  n has  nol 
been  set,  and  an  arbitrary  interval  is  employed,  it  is  still  necessary  to 
pass  through  the  entire  sequence,  even  though  we  may  seem  to  have  an 
ample  number  of  items  after  wo  have  proceeded  only  part  of  t lie  way. 
If  \vc  di-con  tinned  the  drawings  lecture  completing  the  entire  circuit,  we 
would  deprive  the  units  in  the  omitted  segment  of  the  opportunity  of 
being  chosen  and  thereby  destroy  the  randomness  of  the  operation. 
For  example,  by  skipping  names  b to  £ in  an  alphabetical  lifting,  wc 
would  almost  certainly  produce  a biased  sample. 

Since  each  interval  contributes  one  and  only  one  item  to  each  sample. 
It  follows  that  there  can  be  no  more  different  samples  than  there  a tv 
items  within  the  interval.  Thus,  if  die  sampling  interval  i<  equal  to  10. 
there  can  be  only  10  possible  samples,  regardless  of  (he  sire  of  the  uni- 
verse, be  it  1,000  or  1 ,000.000.  But  the  number  of  different  sample* 
would  be  almost  incalculable  in  simple  unrestricted  random  sampling. 
This  restriction  in  the  number  of  possible  samples  serves  to  dUtinguidi 
interval  sampling  from  simple  random  sampling,  since  the  litter  furnishes 
an  equal  opportunity  of  selection  to  every  di-tincf  combinatorial  sample 
of  n items. 

In  spite  of  tbi*  severe  limitation,  interval  sampling  will  often  produce 
results  that  compare  favorably  in  representativeness  to  tho*c  yielded  bv 
simple  random  sampling.  In  particular,  whenever  the  values  may  bo 
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presumed  to  he  rui  Jumly  oideral,  simple  random  rind  interval  stapling 
will  yield  identically  accurate  results.  Such  u presumption  is  often 
reasonable  when  items  have  been  alphabetically  lifted,  since  there  is 
usually  no  correlation  between  the  alphabetical  order  of  names  and  the 
traits  which  the  named  objects  possess.  Thus,  we  have  previously  seen 
that  the  alphabetical  listing  of  large  American  cities  orders  (he  respective 
suicide  rales  in  a sequence  which  is  seemingly  purely  random.  Simi- 
larly, an  alphabetical  listing  of  students  may  be  expect'd  to  result  in  a 
sequence  of  grade  averages  that  is  wholly  random  and,  therefore,  free  of 
(rends  and  cycles.  In  such  cases,  it  makes  no  ultimate  difference  whether 
\vc  select  random  digits  or  draw  every  /;th  unit,  except  that  interval 
sampling  is  usually  more  simple  to  execute.  The  long-run  results  would 
be  virtually  identical  for  any  given  size  sample. 

There  is,  however,  one  notable  circumstance  that  constitutes  a special 
hazard  for  interval  sampling  and  may  easily  lead  to  erroneous  conclu- 
sions. When  (he  universe  values  form  a eyeffeu?  prcgrc$*iO'tt  the  sampling 
interval  may  coincide  with  the  phase  of  the  cycle,  causing  interval  sam- 
pling to  yield  an  unrepresentative  set  of  identical  values.  Let  us  consider 
a fictitious  sequence  whose  phase  is  four;  1,  2,  3.  2;  l,  2,  3,  2;  1,  2,  3,  2. 
Now,  if  the  sampling  interval  is  set  equal  to  I,  any  sample  will  necessarily 
cousbt  of  a set  of  identical  values.  It  will  consist  of  all  l1?,  all  2 V,  or  all 
3's.  Such  samples  w ill  in  no  way  do  justice  to  the  variation  in  the  universe. 

This  type  of  pitfall  is  illustrated  in  a sampling  study  of  June  issues  of 
the  Sunday  AW  IVr/;  77mrs,  1032-1912,  which  disclosed  that  only 
Protestant  marriages  were  featured  on  the  society  page  of  the  sampled 
issues.*  From  this  finding,  the  conclusion  was  drawn  that  the  upper- 
upj'cr  social  class  of  New  York  City  was  preponderantly  Protestant  in 
icligions  background.  But  this  inference  was  immediately  challenged  on 
the  ground  that  Jewish  marriages  happen  for  ceremonial  reasons  not  to 
l>o  performed  in  June,  and  therefore  had  no  opportunity  to  appear  in 
the  issues  of  the  Time*  which  were  sampled.  A check  sampling,  tin- 
disturbed  by  daily  and  monthly  cycles,  revealed  that  Jewish  marriages 
were  in  fact  vrgulatly  featured  by  the  society  editors  during  the  appropri- 
ate seasons.  By  that  criterion,  Jews  were  proportionately  represented 
in  the  upper  social  strata.  In  this  instance,  the  sample  interval  kd  to  an 
ovcm 'pie?ontation  of  Frotoslants,  an  error  compounded  by  the  un- 
fortunate judgmental  selection  of  June  ns  a point  of  origin. 

But  interval  sampling  also  carrier  Us  intrinsic  advantage*.  In  fact, 
when  (lie  numerical  values  form  an  ariYAmriic  ptograticn,  interval  sam- 
pling will  bo  even  mono  ctfee ti\e  than  simple  random  sampling.  For,  in 

* Divkl  and  Mary  Hatch,  '‘Criteria  of  Social  Sta<u«  ft?  Derived  hot*  Marriage  An- 
notirrtfrrrtrit?  in  the  .Yor  Port  Aremet*  $<K&Cfkt4  /fcior,  XI  lt  1917,  pp.  306* 

403;  and  \V.  J.  Cahmrun,  'A  Note  on  Marriage  A rmotioc*  merits  in  the  Xnt 
Tint*"  Jwcrreea  S&hXcpetl  fierier,  XIH,  101$,  pp.  96*V7. 
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that  event,  the  sample  will  necessarily  distribute  itself  evenly  over  the 
entire  range  of  values  and  thereby  provide  a reliable  miniature  of  the 
population  distribution.  If,  for  example,  we  select  every  fifth  boy  from 
a lineup  according  to  height,*  the  resulting  sample  will  necessarily  be  repre- 
sentative of  the  distribution  of  boys'  heights.  Analogously,  if  we  sample 
cvcrjr  25th  dwelling  unit  along  a metropolitan  avenue  after  a random  start, 
wc  would  probably  obtain  an  accurate  cross-st  lion  of  the  various  socio- 
economic groupings  on  that  street,  since  dwelling  units  are  segregated  and 
ordered  according  to  social  status.  In  this  way  interval  sampling  may 
supply  its  own  stratification. 

From  this,  it  is  evident  that  the  principal  advantage  of  interval  sampling 
lies  in  the  mechanical  case  with  which  it  can  be  applied  to  such  natural 
sequences  as  rows  of  dwelling  units,  card  files,  city  directories,  and  so  on. 
Its  special  hazard  is  the  cyclical  sequence,  and  wc  must  maneuver  to  cir- 
cumvent it  when  that  danger  is  thought  to  exist. 

Interrelatedness  oj  Random  Sampling  Procedures.  Quite  obviously,  the 
foregoing  random  procedures  arc  not  mutually  exclusive.  They  may  be 
— and  usually  are  — combined  in  a variety  of  ways.  Cluster  sampling 
may  be  used  within  broad  strata,  and  interval  sampling  may  be  used 
within  these  clusters.  No  single  sample  design  is  best  for  any  given  pm- 
pose.  In  all  sampling,  an  attempt  is  made  to  attain  the  desired  degree 
of  representativeness  as  economically  as  possible,  which  is  the  guiding 
criterion  of  modern  sampling  design.  Wc  should  recognize  that  the 
designing  of  a sample  is  a form  of  statistical  engineering  and  accounting, 
requiring  appropriate  skills  and  knowledge.  In  the  foregoing  statement, 
we  have  merely  hinted  at  the  technical  aspects  of  sampling,  which  are  of 
course  fully  developed  in  treatises  on  that  subject. 

However,  effective  sampling  requires  much  more  than  mere  technique. 
If  the  assets  of  a given  sampling  procedure  are  to  be  fully  realized,  it  is 
essential  that  all  necessary  discretionary  as  well  as  mechanical  steps  be 
expertly  performed.  Thus,  the  anticipated  benefits  of  stratified  sampling 
will  not  be  attained  unless  the  strata  into  which  the  population  was  judg- 
mentally  divided  before  the  sample  was  drawn  actually  differed  among 
themselves  on  the  sampled  trait.  In  the  aforementioned  instance,  stratifi- 
cation by  race  would  be  profitless  if  Negroes  and  whites  shared  the  same 
opinions  on  school  integration.  If  on  the  other  hand,  Negroes  and  whites 
differed  widely,  then,  statistically  speaking,  there  would  have  been  con- 
siderable variation  between  strata,  but  relatively  little  within  each  stratum, 
thereby  validating  the  original  decision  to  stratify  by  race.  It  is  more 
efficient  in  terms  of  sample  size  to  sample  from  two  homogeneous  strata 
than  from  one  very  mixed,  heterogeneous  stratum.  The  knowledgeable 
worker  must  anticipate  the  validity  of  t lie  stratifying  criterion  before  the 
sampling  begins. 
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Analogously,  the  < fleet iver.cr*  of  cluster  sampling  will  be  enhanced  by 
exjic-i tly  composing  and  recomposing  dusters  in  advance  of  the  sampling 
so  that  each  duster  is  as  representative  of  the  entire  population  a?  possible. 
Insofar  as  that  effort  can  achieve  success,  given  the  heterogeneity  of  the 
grand  universe  itself,  the  respective  clusters  will  tend  to  resemble  one 
another!  while  there  will  be  considerable  statistical  variation  within  each 
cluster.  Thus,  hi  cluster  sampling  we  invert  the  specifications  of  stratified 
sampling:  instead  of  homogenizing  strata  we  diversify  the  elements  within 
clusters.  \Yc  may,  for  exar.  pie,  combine  precincts  into  larger  geographic 
districts  in  order  to  increase  the  diversification  within  clusters  and  thereby 
raise  their  representativeness  of  the  entire  electorate  and  fulfill  their  func- 
tion as  samples. 

In  Him,  no  sampling  technique  is  completely  automatic;  all  involve 
subject-matter  decisions.  Ilencc,  first-hand  practical  experience  with  the 
concrete  subject  matter  contributes  quite  as  much  to  fulfillment  of  a 
sampling  project  as  dexterity  in  the  mechanical  routines  of  applied  sta- 
tistics. 

Questions  and  Prohlems 

1.  Define  the  following  concepts: 

Universe 

Sample 

Infinite  Population 
Finite  Population 
Sampled  Population 
Target  Population 
Availability  Sample 
Haphazard  Sample 
Judgment  Sample 
Random  Sample 
Simple  Random  Sample 
Stratified  Sample 
Cluster  Sample 
Interval  Sample 

2.  State  difficulties  that  might  be  encountered  in  defining  the  populations: 

Families 
Dwelling  Units 
Farms 
Households 
City  Blocks 
Broken  Homes 
Overcrowded  Homes 
University  Students 
Gainfully  Employed 
Social  Class 


SURVEYS  AND  EVIDENCE 


The  following  article  illustrates  the  difficulties  which  lawyers 
interested  in  social  science  evidence  will  hav'!  to  confront.  Professor 
Zeisel  tells  us  that  there  are  two  major  problems  related  to  using 
survey  evidence  in  judicial  proceedings.  "The  first  problem  raises 
the  issue  of  hearsay.  The  second  raises  the  question  as  to  how  inter- 
viewing a few  hundred  people  . . . can  produce  correct  conclusions 
concerning  many  hundreds  of  thousands  of  people.  " The  second 
problem  is  essentially  the  same  one  addressed  by  Mueller  and 
Schuessler  in  the  preceding  reading.  As  both  they  and  Zeisel  point 
out,  the  reliability  of  survey  evidence  is  dependent  upon  how  well 
the  universe  has  been  defined,  the  means  by  which  the  sample  has 
been  drawn,  and  the  techniques  used  to  carry  out  the  actual  inter- 
views. These  procedures  are  important  simply  because  "courts 
generally  have  refused  admittance  to  surveys  in  which  they  have 
discovered  technical  flaws.  " 

But  it  is  Professor  Zeisel's  discussion  of  the  rule  against 
hearsay  evidence  and  how  it  relates  to  surveys  that  we  feel  is  the 
most  important  part  of  the  reading.  The  selection  does  not  offer 
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a guideline  for  practitioners  interested  in  having  surveys  admitted 
into  evidence,  but  it  does  provide  us  with  a clear  analysis  of  the 
legal  profession's  hesitancy  in  using  techniques  and  procedures 
foreign  to  the  traditional  methods  of  resolving  legal  conflicts. 


From  Hans  Zeisel,  "The  Uniqueness  of  Survey  Research," 
Cornell  Law  Quarterly , 45  (1960)  pp,  322-346, 
c Copyright  1960  By  Cornell  University. 

Reprinted  by  permission. 
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H4ns  Zelsel,  “The  Uniqueness  of  S Ukrvfl?  ^search, “ 

45  Cornell  Law  Quarterly,  pp  322-346  (1960). 

“Just  when  a scientific  principle  or  discovery  crosses  the  line  between 
the  experimental  and  demonstrable  stages  is  difficult  to  define.  Somewhere 
in  this  twilight  zone  the  evidential  force  of  the  principle  must  be  recog- 
nized . . . 

Barely  a decade  ago,  in  a trade-mark  contusion  case  involving  girdles 
for  young  ladies,* 1  Judge  Jerome  Frank  playfully  supported  his  dissenting 
opinion  by  a survey  which  he  himself  called  "not  satisfactory.”  In  fact, 
it  was  probably  the  worst  survey  ever  made.  He  simply  solicited  his 
daughter’s  and  her  girl  friends’  opinions. 

Nevertheless,  Judge  Frank  had  a point.  Sadly  remarking  that  “neither 
the  trial  judge  nor  any  member  of  this  court  is  (or  resembles)  a teen-age 
girl,”  he  felt  that  where  the  opinions  of  teen-agers  are  at  issue  nothing 
but  “information  directly  obtained”  from  them  would  suffice.  Regretting 
that  he  had  no  "staff  of  investigators  like  those  supplied  to  administrative 
agencies”  he  proceeded  on  the  sound  principle  that  any  relevant  informa- 
tion is  better  than  none.  j 

By  now,  the  science  of  making  surveys  has  come  into  its  own,  and 
with  increasing  frequency  the  courts  are  called  upon  to  decide  ques- 
tions relating  to  the  admissibility  and  use  of  survey  evidence.  Admin- , 
istrative  agencies,  not  bound  by  the  technical  rules  of  evidence,  will  ; 
usually  admit  a survey  and  then  evaluate  its  probative  power.2 3  In  court 
trials,  especially  jury  trials,  the  issue  of  a survey’s  worth  will  first  arise 
when  admissability  is  decided  on.  But  whatever  the  formal  occasion,  the  j 
evidential  value  of  surveys  is  at  issue.  This  paper  is  an  effort  to  con-  1 
tribute  towards  a better  understanding  of  survey  evidence,  its  value  and  j 
its  limitations,  within  the  legal  framework  which  the  courts  have  begun  ' 
to  develop.®  j 


t See  Contributors’  Section.  Masthead,  p.  347,  for  biographical  data. 

* Frye  v.  United  States,  293  Fed.  1013,  1014  (D.C.  Cir.  1923). 

1 Triangle  Publications  v.  Rohrlich,  167  F.2d  959,  976  (2d  Cir.  1948). 

2 ...  It  has  long  been  settled  that  the  technical  rules  for  the  exclusion  of  evidence 
applicable  in  jury  trials  do  not  apply  to  proceedings  before  federal  administrative 
agencies  in  the  absence  of  a statutory  requirement  that  such  rules  are  to  be  observed. 

Opp  Cotton  Mills  v.  Administrator,  312  U.S.  126,  155  (1941), 

3 A review  of  the  law  and  literature  through  1953  will  be  found  in  Lester  E.  Waterbury, 
•'Opinion  Surveys  in  Civil  Litigation,”  a paper  delivered  at  the  1952  meeting  of  the  American 
Association  for  Public  Opinion  Research,  17  Public  Opinion  Quarterly  71  (1953);  a revised 
version  in  44  Trade  Mark  Rep.  343  (1954)  ; iVTote,  66  Harv.  L.  Rev.  493  (1953).  A more 
recent  and  more  detailed,  but  less  analytical,  inventory  can  be  found  in  Barksdale,  The 
Use  of  Survey  Research  Findings  as  Legal  Evidence  (1957).  A review  of  the  potentialities 
and  shortcomings  of  public  opinion  research,  important  for  both  the  lawyer  ar.d  the  survey 
practitioner,  will  be  found  in  Blum  and  Kafven,  "The  Art  of  Opinion  Research:  A Lawyer’s 
Appraisal  of  an  Emerging  Science,”  24  U.  Chi.  L.  Rev.  1 (1956).  Appended  (at  64-65) 
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The  unique  value  of  a survey  consists  in  its  being  the  only  means  of 
measuring  the  characteristics  of  a group  where  such  a measure  is  at  issue, 
e.g.,  the  proportion  of  people  who  confuse  two  trade-marks,  the  share  of 
a market  held  by  a competitor,  the  level  of  wages  paid  in  an  industry, 
the  amount  of  timber  burned  in  a forest  fire,  the  proportion  of  items 
manufactured  below  specific  standards. 

The  two  survey  aspects  which  raise  major  legal  issues  are:  (1)  whether 
the  information  is  gathered  through  verbal  statements  made  by  the 
survey  interviewees,  or  only  through  observation,  counting  or  measuring 
by  the  field  worker,  and  (2)  whether  the  survey  is  based  on  a census 
enumeration  involving  all  members  of  the  group,  or  merely  on  a sample 
taken  from  that  group. 

The  first  problem  raises  the  issue  of  hearsay.  The  second  raises  the 
question  as  to  how  interviewing  a few  hundred  people,  or  examining  a 
few  specimens,  can  produce  correct  conclusions  concerning  many  hun- 
dred thousands  of  people  or  a whole  shipload  of  nwchandise. 

Accordingly,  for  our  purpose  surveys  may  classified  into  four 
groups:  (1)  Census  surveys  not  involving  verbal  statements,  e.g.,  sur- 
veyors measuring  the  acreage  of  an  area,  and  bookkeepers  or  accountants 
determining  the  amount  .of  a designated  type  of  expenditure;  (2)  Census 
surveys  involving  verbal  statements,  e.g.,  the  standard  job  of  the  U.S. 
Census;  (3)  Sampling  surveys  not  involving  verbal  statements,  e.g.,  the 
job  of  the  Food  and  Drug  Administration  in  examining  shipments  of 
merchandise,  a survey  of  car  license  plates  of  patrons  to  determine  their 
geographic  distribution,* * 4  and  certain  phases  .of  accounting  work;5  and 
(4)  Sampling  surveys  involving  verbal  statements,  e.g.,  public  opinion 
polls  and  similar  interviewing  operations. 

The  major  legal  difficulties  are  compounded  in  surveys  of  type  (4). 
These  public  opinion  poll  surveys,  therefore,  will  form  the  center  of  this 
discussion. 

The  law  has  not  yet  developed  a general  rule  for  the  treatment  of  all 
survey  evidence,  although  it  is  recognized  that  only  two  types  of  surveys 
are  clearly  acceptable.  Survey  evidence  produced  by  the  U.S.  Census, 
although  it  is  hearsay  and  frequently  based  on  sampling  rather  than 
"census”  operations,  is  dearly  admissible.  The  other  type  of  survey  is 
admissible,  not  because  of  its  distinguished  authorship,  but  because  of 
the  nature  of  the  collected  information.  If  the  interview  answers  are  not 


arc  comments  by  two  distinguished  social  scientists,  Paul  F.  Lararsfeld  and  Samuel  A. 

Stouffer. 

4 This  serves  to  establish  the  competitive  position  between  retail  outlets,  or  motion 
picture  theatres.  See  note  73  infra, 

5 See  infra  p.  329. 
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gathered  “for  the  truth  of  the  matter  asserted  therein”  the  survey  will 
be  accommodated  under  the  state-of-mlnd  exception  to  the  hearsay 
rule.® 

Surveys  which  do  not  fall  into  either  of  these  two  categories  can  gain 
admittance  only  under  the  broader,  less  stringent  approach  (of  which 
the  statc-of-mind  exception  is  but  a special  case)  which  weighs  the 
necessity  of  utilizing  the  evidence  against  the  dangers  of  hearsay.  Un- 
less, therefore,  a survey  exception  to  the  hearsay  rule  emerges  at  some 
future  time,  the  admissibility  of  survey  evidence,  as  a rule,  will  hinge 
upon  the  assessment  of  its  worth 

In  two  recent  cases,  the  sharp  dividing  line  between  statc-of-mind 
surveys  and  other  surveys  was  softened.  In  one,  a state-of-mind  survey 
was  admitted  because  there  seemed  to  be  sufficient  safeguards  for  ad- 
mitting that  particular  hearsay  evidence.* 7 *  In  the  other,  a survey  was 
admitted  “for  the  truth  of  the  matter  asserted”  in  spite  of  the  fact  that 
it  was  made,  because  the  hearsay  dangers  were  found  to  be  negligible 
compared  with  the  value  of  the  promised  information.® 

The  privileged  status  which  the  law  accords  to  the  operations  of  the 
United  States  Census  provides  an  appropriate  introduction  for  our  dis- 
cussion because  here,  in  a curious  way,  the  law  has  solved,  expressly  or 
by  implication,  all  the  major  perplexities  created  by  surveys  and  sampling 
operations.  Thereafter,  we  will  consider  the  problems  of  sampling  and 
hearsay  as  they  arise  with  respect  to  other  surveys.  Our  discussion  will 
suggest  that  the  true  difficulties  with  survey  evidence  do  not  arise  from 
its  being  based  on  sampling  operations  or  on  hearsay.  Rather,  these 
factors  will  emerge  as  the  legal  justifications  for  avoiding  difficulties  that 
can  probably  be  met  more  directly. 

The  United  States  Census 

Courts  have  not  hesitated  to  take  judicial  notice  of  Census  data,  and 
some  state  statutes  make  such  data  competent  “prima  facie  evidence” 
as  a matter  of  law.9  This  is  so  although  all  Census  data  are  technically 
h'^rsay,  transferred  as  they  are  from  the  original  respondent  through  a 
long  chain  of  staff  workers.  Furthermore,  much  Census  data  are  based 
on  sample  operations  rather  than  a complete  enumeration  of  all  units. 

One  explanation  for  the  fact  that  the  sampling  issut  has  never  been 
raised  in  connection  with  Census  data  is,  perhaps,  that  so  few  people 
know  that  many  Census  data  ate  based  only  on  samples.  For  instance, 

• See  note  37  infra. 

7 See  note  39  infra. 

4 See  note  40  infra. 

9 5 Wigmore,  Evidence  S 1671  n.  11  (3rd  ed.  1940). 
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the  Census  income  distribution  for  the  United  States  population  and 
its  subgroups  is  based  on  the  questioning  of  every  fifth  household  head. 
Similarly,  such  household  data  as  the  “proportion  of  homes  having 
mechanical  refrigeration”  are  based  on  replies  from  inhabitants  in  every 
fifth  dwelling  unit.  Yet  the  Census  Bureau  considers  these  findings  not 
significantly  less  reliable  than  actual  “census”  data.  It  is  an  obvious 
waste  of  resources  to  ask  a question  of  all  respondents,  if  questioning 
every  fifth  will  yield  results  of  sufficient,  if  not  greater,  accuracy.10 

Even  if  the  sampling  nature  of  some  Census  data  were  better  known, 
it  is  improbable  that  it  would  be  used  to  impeach  their  accuracy.  This 
may  be  inferred  from  the  emphatic  court  endorsement  of  the  counting 
procedures  of  an  official  institution  of  much  lesser  standing.  With  respect 
to  a sampling  examination  conducted  by  the  Federal  State  Seed  Lab- 
oratory in  Alabama,  a court  said: 

Reports  which  are  of  a public  nature  and  taken  under  competent  au- 
thority to  ascertain  a matter  of  public  interest  are  admissible  in  evidence 
against  all  the  world.11 

The  survey  operations  of  the  Census  have  still  another  privilege  vital 
to  the  success  of  any  continuous  interviewing  operation.  By  statute,  the 
information  given  to  the  Census  enumerator  is  treated  as  privileged, 
which  makes  it  impossible  to  verify  the  obtained  information  by  calling 
interviewees  as  witnesses  in  court. 

For  survey  operations  not  conducted  by  the  U.S.  Census  or  similar 
public  bodies,  all  three  objections — that  they  are  based  on  samples,  that 
they  constitute  hearsay,  and  that  there  is  a reluctance  to  abandon  the 
1 anonymity  of  their  interviewees — provide  serious  obstacles  for  the  law. 
! Ore  must  ask,  therefore,  why  a status  given  to  the  U.S.  Census  is  being 
withheld  from  other  survey  operations?  Whatever  the  legal  formula,  the 
true  answer  must  lie  in  the  confidence  which  the  Census  survey  peculiarly 
commands.  This  confidence  derives  from  the  disinterested  character  of 
the  Census  operation  and  from  the  trust  in  its  expertness.12  It  is  sus- 
tained by  the  great  simplicity  and  the  factual  character  of  most  Census 
i questions,  which  leave  little  room  for  error.1* 


10  See  iote  25  infra. 

11  E.  K.  Hardison  Seed  Co.  v.  Jones,  149  F.2d  252,  257  (6th  Cir,  1945). 

12  Compare,  however,  this  passage  in  Judge  Caffey’s  opinion  in  the  Alcoa  case: 

...  I conceive  of  no  reason  for  discriminating  between  piivale  individuals  and  Gov- 
ernment employees  in  formulating  . . . what  constitutes  inadmissible  hearsay. 

United  States  v.  Aluminum  Co.  of  America,  35  F.  Supp.  320  at  $25  (SJD.N.Y,  1940). 

13  Still  another  sample  survey  which  is  admitted  almost  daily  without  much  legal  argu* 
ment  in  our  courts,  and  probably  without  awareness  that  it  is  a sample  survey  is  the 
mortality  (able,  submitted  as  proof  of  the  average  life  expectancy  at  any  given  age, 
subject  to  modification  by  special  trials  of  the  particular  individual. 
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Most  surveys  are  based  on  samples  because,  practically  speaking,  there  j 
is  no  other  way  of  conducting  them.  There  are  even  a few  situations  j 
where  a complete  census  enumeration  is  not  merely  impracticable  but  j 
impossible  because  the  census  would  destroy  the  entire  group  to  be 
measured.  This  is  inevitable  whenever  the  examination  involves  the 
destruction  of  the  examined  unit,  as  in  most  laboratory  examinations  of 
food  and  drugs.14  Similarly,  sampling  naturally  recommends  itself  when- 
ever the  universe  (the  technical  term  for  the  group  from  which  a sample 
is  taken)  is  either  physically  unavailable  or  remains  undetermined  as  to 
its  ultimate  size.  Estimating  the  value  of  burned  timber  is  an  example 
of  the  first;18  estimating  the  potential  Joss  of  bus  fare  from  a projected 
change-over  to  one-way  traffic  is  illustrative  of  the  second.18 

The  most  frequent  and,  in  the  long  run,  most  important  advantage  of 
samples  is  their  overwhelming  economy  as  compared  to  the  often  pro- 
hibitive costs  of  the  census  operation.  While  most  universes  would  permit 
a census  survey  in  theory,  its  costs,  in  terms  of  money  and  time,  as  a 
rule,  are  prohibitive.  In  an  early  attempt  to  approach  something  like  a 
census  survey  in  a Federal  Trade  Commission  case,  one  of  the  commis- 
sioners voiced  this  eloquent  protest: 

I want  to  register  my  protest  at  the  way  in  which  this  case  was  conducted. 
About  a thousand  witnesses  . . . were  permitted  to  testify  as  to  whether 
the  use  of  the  word  ‘Castile’  when  applied  to  a soap  not  made  exclusively 
of  olive  oil,  had  the  tendency  to  deceive  the  public  ....  [T] his  piling  up 
of  cumulative  evidence  is  an  inexcusable  outrage  on  the  public  ....  The 
attorneys  and  the  trial  examiner  traveled  throughout  the  country' for  the 
purpose  of  taking  the  testimony  of  such  witnesses.  About  700  such  were 
subpoenaed  to  testify  at  Spokane  ....  [Testimony  has  caused  the  F.T.C. 
to  waste  hundreds  of  thousands  of  dollars.17 


Judge  Wyzanski,  in  the  S/toe  Machinery  Case,  notes  that  “If  anti-trust 
trials  are  to  be  kept  manageable,  samples  must  be  used.”18 


M . „ . [a]  test (s)  . . . results  in  the  articled  being  rendered  useless  ....  [I Inspection 
and  condemnation  on  the  basis  of  samples  tested  is  clearly  contemplated  by  tie  Act 
[21  U.S.CA.  334(c) J. 

United  States  v.  43  and  one-balf  Gross  Rubber  Prophylactics!  65  F.  Supp.  534,  536  (D. 
Minn.  1946). 

[The  witness]  took  four  sample  plots  in  each  40  acre  subdivision  and  estimated 
the  destroyed  merchantable  timber  . . . . We  are  not  convinced  that  any  improper 
elements  entered  into  the  estimates  of  the  witness.  . . . 

Feather  River  Lumber  Co.  v.  United  States,  30  FJd  642,  643  (9th  Cir.  1929). 

The  Court  accepted  a sample  survey  and  the  opinion  evidence  of  an  expert  witness  on 
the  ground  that  it  was  "dictated  by  necessity,  there  being  no  other  way  to  establish  the 
probable  loss  of  fares.”  Eighth  Avenue  Coach  Corp.  v.  City  of  New  York,  170  Misc.  243, 
251,  10  N.Y.S  2d  170,  IS1  (Sup.  Ct.  N.Y.  County  1939),  afi’d,  2S6  N.Y.  84,  35  N.E.2d  907 


(1941). 

IT  From  Commissioner  Humphrey's  dissent  in  Matter  of  James  S.  Kirk  & Co.,  12  F.T.C. 
272.  289  (1928). 

is  United  States  v.  United  Shoe  Machinery  Corp.,  110  F.  Supp.  295,  305  (D.  Mass. 
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Occasionally,  a situation  may  arise  where  the  universe  to  be  examined 
is  finite  and  small  enough  so  that  the  law,  by  insisting  upon  complete 
enumeration,  does  not  impose  an  impossible  burden  on  the  litigant.  In 
a California  case,  involving  suit  for  overpayment  of  a local  sales  tax, 
plaintiff  offered  an  estimate  of  the  overpayment  based  upon  a sample, 
but  the  court  insisted  upon  an  introduction  of  the  complete  count.18 

The  courts  generally  are  familiar  with  the  fact  that  a sample  must  be 
representative  of  its  universe.  To  state  precisely  what  is  meant  by  a 
representative  sample,  we  might  start  by  recalling  a most  famous  instance 
of  what  was  not  a representative  sample — the  more  than  two-million 
ballots  of  the  1936  Literary  Digest  poll  which  forecast  Landon’s  victory 
over  Roosevelt.  By  relying  on  the  voluntary  response  from  people  listed 
in  telephone  directories,  owners  of  automobile  licenses,  and  its  own  sub' 
scribers,  the  Literary  Digest  sample  WHder*represented  those  social  strata 
which  were  to  sweep  Roosevelt  into  his  second  term.  A representative 
sample,  then,  is  one  in  which  all  members  of  the  group  have  an  equal 
chance  of  being  selected.  Such  a sample  will  represent  all  subgroups 
more  or  less  proportionately.20 

The  only  way  to  insure  an  adequate  representation  is  by  some  form  of 
lottery.  A lottery,  with  all  members  of  the  universe  susceptible  of  selec- 
tion, is  at  the  core  of  every  sampling  operation.  The  so-called  systematic 
sample,  whereby  every  nth  item  or  name  from  a list  is  selected,  is  but  a 
hidden  form  of  lottery.  It  is  implied  that  these  »th  units  are  in  fact 
random  selections.21  The  many  variants  and  problems  in  sample  design 
do  not  so  much  derive  from  the  basic  and  easily  solved  problem  of  se- 


1953),  aff’d,  347  TJ.S.  521  (1954).  Compare  also  Ihe  strong  endorsement  of  survey  evidence 
! in  "streamlining  the  Big  Case”  Report  of  Special  Comm,  of  the  Section  of  Anti-Trust  Law, 
Am,  Bar.  Assoc.,  Sept.  15,  1958,  p.  31. 

j 19  Sears  Roebuck  & Co.  v.  The  City  of  Inglewood,  reported  in  Sprowls,  "The  Admis- 
sibility of  Sample  Data  into  a Court  of  Law,”  4 U.C.L.A  L.  Rev.  222-32  (1957).  See  also 
note  34  infra. 

20  Sometimes,  however,  if  one  wants  to  measure  characteristics  of  a relatively  small  sub- 
group  it  will  be  necessary  to  over-represent  that  sub-group  in  the  sample  (over-sampling) 
so  as  to  have  a sufficiently  large  cross-section  of  the  sub-group.  If  or  wanted  to  com- 
pare, for  instance,  households  with  and  without  automobiles,  one  might  sample  every 
non-owner  but  only  every  fourth  owner,  so  as  to  have  equally  sized  samples  of  both  groups. 

21  Judge  Wyzanskl  used  a s?raple  of  similar  design: 

} At  the  Court’s  suggestion,  the  Government  took  and  offered  in  addition  to  the  OMP.'s 
[Outside  Machine"  Reports]  depositions  {a  sample]  of  45  shoe  manufacturers  operat- 
i mg  55  factories.  The  Court  arbitrarily  selected  from  a standard  directory  of  shoe 
‘ manufacturers,  the  first  15  names  that  began  with  Oie  first  letter  of  the  alphabet,  the 
first  15  names  that  began  with  the  eleventh  letter  of  the  alphabet,  all  8 of  the  names 
that  began  with  the  twenty-first  letter  of  the  alphabet,  and  the  first  seven  of  the 
n2mes  that  began  with  the  twenty-second  letter  of  the  alphabet.  This  sample  covers  3 
per  cent  of  *be  sbee  manufacturers.  Th*  sample  includes  small  and  large  factories, 
and  concerns  manufacturing  shoes  according  to  substantially  the  most  popular  shoe 
manufacturing  processes. 

United  States  v.  Uniltd  Shoe  Machinery  Corp.,  110  F.  Supp.  295,  305.  (D.  Mass.  1933). 

Such  clustering,  however,  is  not  without  danger  since  it  may  coincide  with  natural 
clusters  on  the  Ust,  such  as  the  names  beginning  with  "Me”  or  with  "Rosen." 
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lecting  a representative  sample,23  but  from  the  secondary  difficulty  of 
selecting  a sample  with  maximum  efficiency,  that  is,  with  minimum  costs. 

Lim  itations  Upon  the  Accuracy  of  Sample  Measurements 

The  price  to  be  paid  for  accepting  a sample  instead  of  a census  enu- 
meration is  a certain  degree  of  uncertainty.  This  might  seem  a shocking 
admission  in  an  argument  for  the  acceptance  of  samplirg  evidence.  But 
this  shortcoming,  as  we  shall  see,  is  mitigated,  first,  by  the  fact  that  the 
degree  of  uncertainty  can  be  reduced  to  any  desired  magnitude  (though 
never  to  zero);  second,  because  the  law  is  rarely  in  a position  to  insist  on 
certainty;23  and  third,  and  most  important,  by  the  fact  that  census  opera- 
tions, too,  are  open  to  error.  Experts  insist  that  a sample  survey,  in  spite 
of  its  inherent  uncertainty,  can,  at  times,  be  more  accurate  than  the  cor- 
responding census  operation.24 

Let  us  first  clarify  the  nature  of  this  uncertainty  in  sample  measure- 
ments. Suppose  we  found  that  68  per  cent  of  the  housewives  in  a sample 
had  bought  at  the  XX  food  store  chain.  What  can  we  say  about  the 
corresponding  percentage  in  the  universe  from  which  this  sample  was 
drawn?  Ignoring  the  technical  niceties,  the  relationship  is  expressed  as 
follows:  A range  is  set,  extending  in  both  directions  around  the  sample 
measurement,  e.g.,  from  62  to  74  per  cent,  also  written  as  68  ± 6 per 
cent.  The  probability  that  the  tiue  (universe)  measure,  if  determined  by 
a census,  will  fall  within  this  range  is  then  determined.  If  in  this  par- 
ticular instance  the  sample  measurement  of  68  per  cent  was  derived 
from  a sample  of  5,400  housewives,  the  odds  will  be  99  to  1 that  the 

S2  The  Federal  Seed  Act  Regulations,  7 C.F.R.  IS  201.208-19,  and  especially  $3  201.210- 
13  (1959)  contain  explicit  sampling  instructions  and  thereby  acknowledge  this  underlying 
principle. 

23  . . . [Although,  admittedly  such  summaries  arc  not,  md  could  not  be,  absolutely 
complete,  they  are  the  most  accurate  and  most  dependable  data  in  existence  to  establish 
this  fact. 

Hughes  Too!  Co.  v.  Ford,  114  F.  Supp.  325,  542  (ED.  Okla.  1953). 

24  The  paradox  is  easily  explained.  Census  operations  are,  by  definition,  gigantic  tasks, 
necessitating  the  hiring  of  a great  staff,  the  quality  of  which  eludes  control.  It  is  suggested 
that  population  sampling  surveys,  in  certain  respects,  would  produce  more  accurate  results 
than  the  population  ‘census/  Some  of  the  Census  inaccuracies  are  well  known:  Reported 
age  distributions  have  a slight  tendency  to  bulge  at  the  round  (10  year)  intervals;  transient 
(hotel,  etc.)  populations  are  underrated;  babies  under  1 year  similarly  are  underrated— at 
the  expense  of  babies  who  are  reported  as  having  reached  their  first  year. 

For  the  array  of  the  survey  problems  thi  Bureau  of  the  Census  is  confronted  with,  com- 
pare Hauser,  “Labor  Force  and  Gainful  Workers — Concept,  Measurement,  and  Compar- 
ability,1" 54  J.  Sociology  333  (1949). 

Compare,  also,  the  following  reference  from  a case  before  the  Illinois  Commerce 
Commission: 

The  Company’s  evidence  with  respect  to  depreciation  . . . was  based  primarily  upon 
a sample  of  the  Company’s  property  ....  It  was  pointed  out  that  the  use  of  a sample 
tends  toward  greater  accuracy  in  the  final  conclusions,  inasmuch  as  a reasonably  in- 
tensive inspection  of  the  entire  plant  is  virtually  impossible  considering  the  limitation 
of  time  and  money  that  should  reasonably  be  imposed. 

Illinois  Bell  Telephone  Co,  7 P.U.R.3d  41,  606  III.  Commerce  Comm’n  493,  506  (1955). 
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statement  is  correct  that  the  true  universe  percentage  will  fall  within 
the  indicated  limit.  Given  a certain  sample,  the  odds  will  be  lower  where 
the  narrower  limits  of  the  range  are  posted.  In  our  example,  the  odds 
will  be  about  19  to  1 that  one  is  correct  in  asserting  that  the  true  value 
will  fall  within  68  ± 4 per  cent,  and  only  2 to  1 that  they  will  fall  within 
68  ± 2 per  cent,  i.e.,  between  66  and  70  per  cent. 

A special  case  of  this  estimating  procedure  is  acceptance  sampling,  a 
customary  practice  in  quality  control  of  production  and  accounting.  The 
task  is  to  determine,  by  means  of  a sample  and  in  terms  of  agreed-upon 
odds,  whether  the  proportion  of  faulty  items  in  the  universe  does  or  does 
not  exceed  a specified  tolerable  minimum.  If  no  faulty  item  is  tolerable, 
sampling  must,  of  course,  never  be  resorted  to,  as  that  item  might  be 
hidden  in  the  non-sampled  part  of  the  universe.25  But  if  some  small  pro- 
portion of  faulty  items  is  tolerable,  however  small,  a sample  can  be 
designed  to  assure  confidence  odds  for  such  a margin.2* 

This  situation  requires  the  survey  planner  to  determine  the  necessary 
accuracy  of  the  sample  measurement  and  to  design  his  sample  accord- 
ingly. The  accuracy  with  which  a sample  result  can  be  projected  depends, 
as' a rule,  on  two  factors:  the  absolute  size  of  the  sample  and  the  variance 
of  the  property  to  be  measured.  The  widely  held  belief  that  the  accuracy 
of  a sample  is  connected  with  its  relative  size  to  the  universe  is  mis- 
taken.27 A sample  smaller  than  1 per  cent,  taken  from  one  universe,  can 
be  much  more  reliable  than  one  comprising  10  per  cent  of  another.  To 
determine  with  equal  accuracy  the  average  age  of  the  population  of  New 
York  City  and  of  Peoria,  Illinois  will  require  samples  of  equal  size. 

But  if  the  size  of  the  universe  is  immaterial,  its  variance  is  of  great 
importance.  Assume  we  wanted  to  determine  the  mean  age  of  two  groups 
of  people  and  w:anted  to  be  sure  of  certain  minimum  odds  that  the  sample 


25  To  think  otherwise  would  be  like  the  person  who,  in  counting  a bundle  said  to  con- 
tain 1000  one  dollar  notes,  stops  satisfied  al  996  and  exclaims:  “If  it  was  all  right  up  to 
now,  the  rest  will  be  al!  right  too.” 

Sampling,  as  a legitimate  part  of  the  accountant’s  practice,  is  acknowledged  In  Ultra- 
marcs  Corp.  v.  Touche,  255  N.Y.  170,  191.92,  174  N.E.  441,  449  (1931). 

The  defendants  [charged  with  having  falsely  certified  a balance  sheet)  attempt  to 
excuse  the  omission  of  an  inspection  of  the  invoices  proved  to  be  fictitious  by  in- 
voking a practice  known  as  that  of  testing  and  sampling.  A random  choice  of  ac- 
counts Is  made  from  the  total  number  on  the  books,  and  these,  if  found  to  be  regular 
...  are  taken  as  a fair  indication  of  the  quality  of  the  mass.  . . . Verification  by  . . . 
sample  was  very  likely  a sufficient  audit  as  to  accounts  regularly  entered  upon  the 
books  in  the  usual  course  of  business. 

The  court  goes  on  to  say  that  the  sample  in  the  present  case  did  not  excuse  the  omission, 
not  because  sampling  in  itself  is  improper,  but  only  because  the  crucial  entry  in  the  ledger 
of  assets,  plainly  interpolated  and  suspicious  on  its  face,  was  excluded  from  the  sample. 
For  technical  details  on  sampling  in  accounting  work,  see  Vance  and  Netter,  Statistical 
Sampling  for  Auditors  and  Accountants  (1956). 

21  Only  in  those  rare  cases  where  the  sample  becomes  a sizable  proportion  of  the 
universe,  say  20  per  cent  or  more,  do  adjustments  for  approaching  the  size  of  the  universe 
need  to  be  made  in  the  formula. 
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result  be  not  more  than  3 years  oft  the  true  mean  age  of  the  sampled 
group.  Suppose,  further,  that  one  group  to  be  sampled  Is  the  general 
population  of  a city  and  the  second  is  the  residents  in  all  the  old  age 
homes  of  that  city.  Since  age  in  the  general  population  varies  from  zero 
to  approximately  one-hundred,  the  general  population  will,  for  the  same 
degree  of  accuracy,  require  a larger  sample  than  the  old  age  home  popu- 
lation, whose  age  varies  only  between  seventy  and  one-hundred. 

The  inverse  relationship  between  variance  and  sample  size  is  most 
visible  in  the  sampling  of  well  mixed  liquids;  a drop  of  blood  will  provide 
an  accurate  count  of  the  blood  cells  per  volume,  because  this  percentage 
varies  little  from  drop  to  drop.  There  is,  indeed,  no  more  homogenous 
universe  than  a well  mixed  liquid.  Itence,  even  a very  small  sample 
might  yield  an  accurate  measurement.  Yet  the  following  generalization 
is  not  warranted: 

(One  of  the]  prerequisites  necessary  to  the  admission  in  evidence  of 
samples  [is]  that  the  mass  should  be  substantially  uniform  with  reference 
to  the  quality  in  question.  . . .** 

The  court  errs  here,  for  also  a universe  with  low  homogeneity  permits 
sample  measurements  of  great  accuracy;  It  merely  requires  a corres- 
pondingly larger  sample.  Uniformity  or  homogeneity  or  variance — the 
three  terms  are  used  here  interchangeably — is  a matter  of  degree  and 
can  be  compensated  by  appropriate  sample  design  and  size. 

Two  points  emerge;  first,  that  the  degree  of  accuracy  of  any  sample 
can  be  Increased  to  the  desired  limit;  and  scccn  I,  that 'the  degree  of 
accuracy  with  which  the  sample  is  related  to  its  corresponding  value  in 
the  universe  can  be  stated  with  precision.  Expctt  knowledge  of  the 
sampling  error  Is  as  unambiguous  as  the  knowledge  of  the  relationship 
between  a circle's  radius  and  its  circumference. 

The  Sampling  Error  and  the  Law 

Sample  measurements  raise  interesting  legal  issues.  It  might  seem  dis- 
concerting that  no  sample  measurement  can  be  stated  with  complete  con- 
fidence in  its  accuracy.  But,  as  we  pointed  out,  the  law  is  accustomed  to 
dealing  with  less  than  perfect  evidence.  Both  standards  of  proof,  “rea- 
sonable doubt”  and  "preponderance  of  evidence,”  allow  for  imperfections. 
Rather,  It  is  the  positive  aspect  of  sample  measurements  that  raises  new 
issues,  f.e.,  the  possibility  of  actually  measuring  the  degree  of  uncertainty 
or  certainty  through  tolerance  limits  for  the  measurement  at  issue. 

One  way  in  which  the  law  might  deal  with  the  problem  is  to  accept  the 


t*  E.  K.  Hirdbofi  Seed  Co.  r.  Joses  *****  11  it  J5J. 
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range  of  the  sampling  error  because  any  value  within  its  range  would 
falf.ll  the  immediate  legal  requirements: 

For  the  evidence  is  all  in  one  direction  ....  Different  methods  of 
weighting  the  various  factors  would  produce  different  results;  but  no 
reasonable,  qualified  person  would,  by  any  rational  process,  reach  a figure 
outside  that  range;  and  probably  most  methods  would  reach  a figure  close 
to  the  middled  it.  Moreover,  even  though  this  high  figure  is  not  attained 
in  every  part  of  the  market,  nonetheless  the  figure  may  be  fairly  used 
since  United  supplies  in  every  significant  generic  class  of  shoe  machinery, 
except  machinery  used  in  the  cement  process;  and  in  rubber  shoe  manufac- 
turing, and  of  course,  excepting  dry  thread  sewing  machinery,  far  more 
than  50%  of  the  demand.  In  short,  it  is  not  inaccurate  in  this  market  to 
say  United  has  a 75-95 share;  and  it  probably  would  be  accurate  to  say 
an  approximately  85%  share.** 

Or  the  result  of  the  measurement,  however  inaccurate,  may  clearly  fall 
short  of  the  legal  requirement: 

. . . (P)laintiff  did  produce  witnesses  who  testified  as  to  confusion  .... 
However,  considering  that  annually* *  more  than  70,000,000,  rolls  of  plain- 
tiff’s . . . mints  are  sold,  to  say  nothing  of  many  millions  of  packages  of 
other  flavors  ...  it  would  be  extraordinary  if  some  confusion  could  not  be 
found  irrespective  of  the  details  of  the  dress  of  the  package  ....  A new 
competitor  is  not  held  to  the  obligations  of  an  insurer  against  all  possible 
confusion.*0 

In  these  cases  the  court  found  the  magnitude  of  the  sample  measurement 
so  clearly  above  or  below  the  relevant  legal  limits  that  it  deemed  it 
unnecessary  to  put  numerical  values  on  these  limits.  In  the  following 
opinion,  however,  full  use  was  made  of  the  possibility  of  measuring  the 
accuracy  of  a sample: 

The  Commission  retained  (an  expert]  ...  to  anatvee  the  Company’s 
sample  of  its  plant  to  determine  . . . whether  it  was  fairly  representative  of 
the  plant  as  a whole.  He  testified  that  in  his  opinion  the  sample  was  fairty 
drawn  and  that  the  chance  of  it  varying  as  much  as  1%  from  the  plant 
as  a whole  was  negligible.  We  conclude  that  the  Company’s  sample  shoutd 
be  accepted  as  a fair  cross-section  of  the  plant  as  a whole.** 

The  courts  could  go  further  and  specify  standards  for  sample  measure- 
ments since  the  sampling  error  can  be  reduced  at  will  by  enlarging  the 
sample.  To  be  sure,  there  is  a serious  cost  element  to  be  noted.  Since  the 
sire  of  the  sample  error  is  inversely  related  to  the  square  of  the  sample 
site,  the  site  of  the  sample  must  be  quadrupled  to  reduce  a sampling 
error  to  one-half  of  its  original  site. 

An  interesting  solution  was  proposed  in  the  California  case  in  which 

*♦  Jodi*  WyliwlJ  in  United  Slitei  v.  United  She*  Miehinery  Ccep-i  mpei  not*  71  it 
SO?. 

M |_ift  Siveri  Ccep.  v.  Curti$i  Cindy  Co-  tS?  F.7d  4,  t (?tl>  Cie.  I9W). 

*i  UUncii  Bell  Telethon*  Co,  Jvpei  not*  >4. 
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suit  was  instituted  for  overpayment  of  a local  sales  tax.  Plaintiff  offered 
an  estimate  of  the  overpayment  based  on  a sample  survey  as  follows: 
$28,250  with  2 to  1 odds  that  the  true  (census)  value  would  fall  between 
$23,150  and  $30,350,  and  19  to  1 odds  that  it  would  fall  between  $23,950 
and  $32,450.  Plaintiff  voluntarily  reduced  his  claim  to  $27,000,  thereby 
suggesting  his  willingness  to  absorb  the  greater  part  of  the  sampling 
error.  The  court  insisted  on  a complete  census  count,  however,  only  to 
discover  that  it  yielded  $26,750,  a sum  within  1 per  cent  of  the  claimed 
amount.** 1 

In  most  situations,  however,  the  universe  to  be  measured  will  be  so 
large  that  the  court  will  not  have  the  choice  between  a sample  measure- 
ment and  a census,  but  rather,  would  either  have  to  accept  a sample 
measurement  or  have  no  measurement  at  all.  In  the  latter  situation  the 
relatively  inaccurate  sample  measurement  may  provide  better  evidence 
than  the  law  now  possesses.  That  a sample  value  may  be  inaccurate  by 
some  small  margin  of  error  could  be  of  less  importance  than  the  fact  that  it 
can  provide  a measurement  as  accurate  as  it  does.  This  possibility  be- 
comes particularly  Important  when  the  measurement  refers  to  the  core 
of  the  litigated  Issue,  and  hence  becomes  a measure  of  the  soundness  of 
the  court’s  judgment  or  verdict.  Two  examples  may  be  cited. 

The  owner  of  a gambling  establishment,  at  his  trial**  for  income  tax 
evasion,  contended  that  his  lottery  wheel  had  retained,  over  the  years  in 
question,  some  11  per  cent  of  the  placed  bets.  The  prosecution  introduced 
expert  evidence  to  the  effect  that  the  laws  of  prooability  suggest  that  the 
wheel  (with  its  specific  game  rules)  must  earn,  in  the  long  run,  some 
22  per  cent  of  the  gross  intake,  that  Is,  about  twice  the  amount  for  which 
the  defendant  argued.  The  prosecution  might  have  gone  one  interesting 
step  further  and  offered  evidence  in  the  following  form:  “The  odds  that 
the  defendant’s  statement  Is  true — that  the  wheel  earned  not  more  than 
II  per  cent — are  one  in  a thousand”  (or  whatever  the  odds  were,  com- 
puted from  the  rules  of  the  game  and  the  actual  number  and  size  of  the 
placed  bets). 

The  use  of  blood  tests  In  paternity  suits  provides  another  example,  it 
appears  that,  because  of  the  possibility  of  accidental  mutations,  their 
evidentiary  vatue  Is  riot  completely  foolproof.  On  the  average,  it  is  esti- 
mated that  the  blood  test  may  err,  by  excluding  the  actual  father  from 
parenthood,  In  one  birth  In  10,000.  Because  the  paternity  cases  which 

1*  Sears  Roebuck  It  Co.  v.  The  Citr  of  Inglewood,  wpn  note  If.  Tbe  Issue  posed  Li 
Dial  uw  could  pr oxide  a more  tcnerai  rationale  for  deciding  the  degree  of  accuracy  that 
ihoald  bt  retired  of  a sample.  The  answer  could  come  from  asking  these  l to  questions: 
(t)  How  much  wouM  it  tost  to  increase  the  accuracy  of  the  sample  to  a specified  limit? 

(I)  VVhal  would  be  the  d t!ar  equivalent  el  this  gain  in  accuracy  for  the  L«sue  at  band? 

M United  States  v.  Sanders  Scott,  IIS  (tvb  Cir.)  (unpabUshtd), 
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reach  the  courts  are  more  likely  to  be  among  the  few  where  the  test  fails, 
it  is  estimated  that  the  test  would  lead  the  court  into  error  about  1 time 
in  a hundred.  The  question  arises,  therefore,  whether,  iu  view  of  this 
fallibility,  the  blood  test  can  siill  be  accepted  as  incontrovertible  proof. 
The  Danish  jurist  who  drew  attention  to  this  particular  aspect  of  the 
problem  suggests  that  blood  tests  be  accepted  as  such  proof.  To  permit 
their  impeachment  by  witness  testimony  could  only  increase  a ratio  of 
error  that  is  otherwise  kept  to  1 in  100.*4 

The  Hearsay  Barrier 

Since  opinion  surveys  reflect  statements  made  by  third  persons  to  an 
ink r viewer  who,  in  turn,  has  related  them  to  the  analyst  who  may  appear 
as  an  expert  witness  in  court,  such  evidence  is  clearly  hearsay.  But  the 
courts  have  developed  a line  of  authority  which  exempts  surveys  from 
the  hearsay  rule  if  the  declarations  are  reported  not  “for  the  truth  of  the 
matter  asserted  therein,”  but  as  expressions  of  the  interviewee’s  state-of* 
mind.  Confusion  of  names  and  trade-marks  provide  the  leading  examples: 

The  hearsay  objection  is  unfounded  . . Hhe  statements  oi  the 
persons  interviewed  were  not  offered  for  the  truthfulness  of  their  assertions 
. . . (TJhey  were  offered  solely  to  show  as  a fact  the  reaction  of  . . . the 
public.  . . . Only  the  credibility  of  those  who  took  the  statements  was 
involved,  and  they  were  before  the  court.** 

It  is  true  that  some  of  the  hearsay  dangers  are  reduced  by  an  approach 
of  this  kind;  but  they  are  by  no  means  absent.  Thus,  while  one  may  be 
inclined  to  welcome  any  opening  for  the  acceptance  of  survey  evidence, 
one  cannot  wholeheartedly  endorse  this  particular  one. 

The  state-obmind  doctrine  has  two  disadvantages.  It  may  make  the 
courts  overlook  technical  pitfalls  which  are  more  likely  to  occur  in  state- 
of-mind  surveys  than  in  other  types  of  surveys.  For  example,  the  degree 
of  confusion  of  two  trade-marks  Is,  as  a rule,  the  sum  of  two  forms  of 
confusion:  the  specific  confusion  of  the  two  trade-marks  in  question  plus 
the  general  confi  sion  that  will  obtain  for  any  comparison  In  an  average 
group  of  people  under  most  test  conditions.  Clearly,  unless  the  degree 
of  specific  confusion  is  great,  allowance  must  be  made  for  the  general. 

**  Rest,  "The  V>K’5  of  Blood  Tests  it  Evidence  In  PiUrrity  Cues,"  )l  llarv.  L.  Rev. 

453  (105$).  lie  adds,  lienifiomljv  that  I be  tourlt  in  Dtnroirk  ire  inclined  to  accept 
the  bloodiest  exclusion  is  ibvelute  poof  with  rtspect  to  children  torn  out  of  xrtdiocY* 
but  sho*  greater  reluctance  in  cites  mbrre  lb*  paternity  of  kgitimte  tkDdrcn  ft  it  to it. 

**  United  States  v.  M Ci«*,  I?)  F.;d  96).  974  (3rd  Cir.  1051),  cert,  denkd,  341  VS.  «1 
(1051).  See  also  People  v.  Franklin  Nit!  Ban*,  TOO  Mix.  SSI,  105  N.YSjd  fcl  (Sup.  Ct. 
K t»n  County  1951),  ItvVf,  J05  .V  Y.  453.  IIS  StU  ?96  0951),  revM  34)  US.  3)3 
(1054);  Household  Fir.  nee  Ccrp.  v.  Federal  Finance  Cc.’p,  105  F.  Supp.  H4  (D.  Aril, 
1053).  While  public  epiftkfi  pollt  become  almost  standard  procedure  in  tridc-m.uk  con- 
fusion cases,  curiously  enough,  they  have  never  bten  wed  to  prove  (or  disprove)  the 
user  lion  that  a t/tde  Dime  had  become  i generic  tem. 


Hans  Zeisel 


i 

t 

1 


The  more  serious  difficulty  with  the  state-of-mind  rule,  however  is  j 
that  it  provides  no  basis  for  admitting  surveys  of  facts . In  the  latter 
type  survey,  the  hearsay  danger  is  less  than  in  surveys  of  statcs-of-mind. 
Surveys  which  ask  the  respondents  whether  they  own  a gas  stove,  or 
carry  a certain  brand  of  merchandise  in  their  store,  or  have  reached  a 
certain  age,  are  not  admissible  under  this  hearsay  exception.  Surveys 
asking  for  complicated  psychological  reactions  arc  covered.  Judge  Wy- 
ranski  has  noted  this  problem  and  has  decided  against  the  mechanical 
application  of  the  statc-of-mind  ru^  After  reviewing  the  positions  and 
citing  authorities  for  each,8*  he  r iudes: 

. . • So  long  as  the  interviewees  are  not  cross-examined,  there  is  no  test- 
ing of  their  sincerity,  narrative  ability,  perception,  and  memory.  There  is 
no  showing  whether  they  wcie  influenced  by  leading  questions,  the  en- 
vironment in  which  questions  were  asked,  or  the  personality  of  the  investi- 
gator. But  where  a court  is  persuaded  that  in  a particular  case  all  these 
risks  have  been  minimized,  thet  the  answers  given  by  the  interviewees  arc, 
on  the  whole,  likely  to  be  reliable  indicia  of  their  states  of  mind,  that  the 
absence  of  cross-examination  is  not  prejudicial,  and  that  other  ways  of  j 

getting  evidence  on  the  same  point  are  either  impractical  or  burdensome,  j 

the  testimony  sho  Id  be  admitted.  „ ♦ , In  this  case  these  conditions  have  i 

been  met.  Accordingly,  the  hearsay  objection  is  overruled  and  the  test!-  \ 

mony  of  the  results  of  the  poll  is  admitted.  . . .,T  ■ 

While  this  position  considerably  narrows  the  state-of-mind  exception  ap-  j 
plicabk  to  survey  evidence,  it  must,  in  the  long  run,  prove  sound.  j 

Another  case  which  provides  an  opening  wedge  (or  surveys  which  can-  j 
not  be  accommodated  under  the  staie-o(*mind  exception  is  United  Stoles  ! 
v.  Aluminum  Company  of  America.**  There,  an  expert  testified  to  an 
analysis  of  the  records  of  605  test  holes  drilled  to  determine  the  presence 
and  quality  of  bauxite  deposits,  although  the  persons  who  did  the  drilling 
and  prepared  the  reports  were  not  present  as  witnesses: 

Opinion  testimony  by  an  acceptable  expert  testing  wholly  or  partly  on 
information,  oral  or  documentary,  recited  by  him  as  gathered  from  others, 
which  is  trustworlny  and  which  is  practically  unobtainable  by  other  means, 
is  competent  even  though  the  first  hand  sources  from  which  the  informa- 
tion came  be  not  produced  in  v.'-urt.” 


t»  Some  authorities  hive,  therefore,  concluded  that  the  testimony  fi  not  heamy. 
United  States  v.  Ji  Cases,  IS?  Fid  967  tJd  Cir.  1951);  6 Wifmore,  Lvidence  (Sd 
ed)  | l)?6;  Note  64  Haiv.  L.  Rev.  495,  SOI,  SOS,  note  S4.  Others,  however,  hive 
noted  tint  the  proffered  evidence  bn  kvt  cf  the  dancers  of  hearsay.  See  Note  66 
Marv.  L.  Rev.  495,  $01-507:  Morein,  Heirsiy  Darken  and  the  Application  ot  the 
Hearsay  Concept,  61  Huv.  L,  Rev.  177,  IS5,  202-20S,  206;  MeCormkk,  Tie  Border- 
land of  Hearsay,  19  Yale  L J.  459,  491. 

Aitxrkatft  Work?  v.  United  $u?es  Trunk  Co  , 153  F.  Supp,  50,  53  (D.  Mur  l$5?) 

»»  IS*  F.  Supp  at  51. 

M 15  F.  Supp.  520  (S  DN*  Y.  1940). 

**  td  at  *75. 
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The  court  cites  as  precedent  the  opinion  of  Judge  Learned  Hand  in  G.  & 
C.  Mcrriam  Co.  v.  Syndicate  Publishing  Company:*0 

With  rcjpect  to  the  matter,  in  wliat  impresses  me  as  unambiguous  au- 
thoritative judicial  language,  it  has  been  said  that  'the  requisites  of  an 
exception  of  the  hearsay  rule’  are  ‘necessity  and  circumstantial  guaranty 
of  trustworthiness.’ 

The  court  goes  on  to  clarify  the  term  necessity: 

In  effect,  . . . necessity  ...  is  not  to  be  interpreted  as  uniformly  de- 
manding a showing  of  total  inaccessibility  of  firsthand  evidence  ...  but 
that  necessity  exists  where  otherwise  great  practical  inconvenience  would 
be  experienced  in  making  the  desired  proof.  . . .4I 

The  opinion,  standing  alone,  is  limited.  The  information  in  question 
was  not  gathered  through  interviews;  hence  it  involves  only  one  level  of 
hearsay;  moreover,  it  had  been  gathered  in  the  "ordinary  course  of 
business"  and  only  later  used  for  purposes  of  litigation.  Xor  does  the 
decision  -efer  directly  to  surveys  but  to  expert  testimony  informed  by 
a survey.  It  could,  however,  be  extended  to  the  survey  itself.41 
Let  us  now  consider  mere  closely  th?  dangers  of  insincerity,  faulty 
j narration,  perception  and  memory  as  they  pertain  to  survey  evidence. 

But  let  us  be  sure  to  see  the  problem  in  its  precise  form:  the  issue  is  not 
f,  whether  the  reliability  of  interview  response  would  increase  if  all  inter- 
j viewees  could  be  examined  as  witnesses  in  court,  since  this  is  not  an 
1 available  alternative.  As  a rule,  it  is  not  possible  to  bring  the  universe 
| or  its  truly  representative  sample  into  court.  The  customary  procedure 
is  to  call  a number  of  public  witnesses  who  allegedly  are  representative  of 
, the  universe.  But  a distinguished  lawyer  with  broad  experience  in  this 
j field  had  this  to  say  about  such  a procedure:  "The  poisonous  feature  of 
' the  public  witness  matter  is  . . . that  all  too  frequently  they  are  selected 
| not  Impartially  but  because  they  will  testify  t.ie  way  the  party  selecting 
j them  wants  them  to  testify.’’41  The  very  fact  that  such  witnesses  are 
| arbitrarily  selected  should  render  their  testimony  less  credible.  Even  if 
j these  public  witnesses  would,  In  fact,  give  a more  reliable  response  in 
court  than  to  a survey  interviewer,  their  evidence  should  be  rejected  on 


«•  20  F«t.  SIS,  SI*  (2d  Ctf.  IMS). 

4*  Supra  note  J!  it  IJJ‘14. 

41  ...  II Jt  may  very  well  turn  cut  to  be  tkal  when  tbe  *drr.t<?fNHiy  of  m vtt  evt* 
tJriKf  is  finally  tstalUsbed  beyond  doubt,  tbe  rationale  will  be  taken  from  Ike  fieM  of 
eapert  testimony. 

Vwiftbwry,  •‘Opinion  Survey'  in  Civil  Litigation.*  44  Trademark  Rep.  343,  363  (1954). 

° Id.  at  341  WtUrbury  discuss**  this  practice  in  detail.  He  notu  its  apy^arance  b Stinky 
Laboratories,  frrf.  v.  FTC,  1J*  t td  JftS  <*tb  Cit.  1*43),  and  Rook  oMbe. Monlb  Club, 

I Tit.  v.  rrct  203  FJd  415  (2d  Cif.  M3). 
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the  ground  that  these  individuals  do  not  adequately  represent  the  uni-  i 
verse,44  no  matter  how  many  of  them  are  called.45  | 

Assuming  it  were  possible  to  bring  a truly  representative  sample  of 
public  witnesses  into  court,  it  is  doubtful  that  their  answers  would  always  | 

be  more  reliable  than  those  given  to  a survey  interviewer.  The  inter-  j! 

viewee  is  in  no  way  connected  with  the  litigants,  not  even  through  the  1 

tenuous  bonds  created  by  being  a witness  for  one  side.  Moreover,  the  J 

interviewee  will,  as  a rule,  not  learn  the  purpose  for  which  his  response 
is  used.  In  a proper  survey  routine,  to  prevent  inadvertent  disclosure,  not 
even  the  interviewer  is  told  of  the  survey’s  purpose.4*  In  addition,  since 
the  survey  necessarily  precedes  the  trial,  less  time  will  have  elapsed  j 

between  the  response  and  the  event  to  be  recalled,  than  between  the  J1 

event  and  its  deposition  at  the  trial.  Finally,  the  court  has  before  it  the 
complete  and  uniform  question  schedule  in  response  to  which  the  survey  ^ 

results  were  obtained.  Court  witnesses,  on  the  other  hand,  at  times  under-  | 

go  careful  individual  preparation  prior  to  trial,  the  form  of  which  does  I 
not  necessarily  come  to  the  court’s  knowledge.  j 

To  be  sure,  court  witnesses  may  have  a heightened  awareness  of  what  J 

b at  issue  and  may  be  more  careful  and  perhaps  more  perceptive  than  i 

survey  respondents  who  are  completely  unaware  of  the  ultimate  issues.4* 
Cross-examination,  too,  may  prove  its  value  at  any  time  that  recollection  ' 
or  narration  proves  faulty.  But  even  if  some  of  the  individual  survey 
responses  are  not,  in  fact,  as  totally  accurate,44  the  group  measurement 
may  still  be  sufficiently  accurate  within  set  tolerance  limits.  ' i; 

In  summary,  therefore,  the  advantages  offered  by  survey  responses 
should  at  least  suffice  to  protect  such  evidence  from  outright  disqualifica- 
tion as  hearsay.  Moreover,  the  questions  propounded  In  many  cases  will 


41  Th«  univrrst,  of  touts*,  rnigbl  b«  >4  small  md  »cc«sibl«  ts  to 
court  of  every  one  of  ill  member*. 

W Hence,  it  is  wrong  to  believe  that  the  evfl  of  the  procedure  could  be  cured  by  fling 
ft  sufficient  cumber  of  wfcftcssc*  rt.  • . except  by  calling  as  witnesse*  so  many  of  the  public 
as  to  fender  the  ti?k  impracticable."  People  v.  FrjnkHn  Natl  Bank,  supra  note  J5  at  $6$. 
A bidlv  selected  sample  only  become*  worse  a*  It  become*  larger-  See  note  64  Infra. 

44  *lThe)  witness**  were  not  infotne  ! of  the  purpose  of  thi*  employment  [a  *boppitg 
survey).  . * . Always  two  went  together  so  that  there  were  two  witnesses  to  each  sale." 
Oneida,  Ltd.  v.  National  Silver  Co,  N.YSId  271,  2$6  (Sup  Ct  Madison  County  1540)* 
tl  ^Puithase*  of  merchandise  are  not  mide  In  a vacuum  with  P.ofessor  Quii  in  charge." 
Quaker  Oat*  Co  v.  General  Mills  Inc,  134  P 2d  425,  43J  (?th  Cir  1943), 

Hence,  Judge  Atruld’s  concern  in  New  York  Life  Ins.  Co.  v.  Taylof,  142  F2d  29?, 
J04  (PC  Cir  19 M)  Is  not  quite  to  the  point: 

A corporation  ts  engiged  in  tikim  a nationwide  poTl  ft*  to  the  number  of  member* 
of  the  Communut  Party.  In  the  regular  course  of  that  bj<ine<s  * . . the  Inter- 
viewer  report*  that  X,  Y and  Z are  Communist*,  giving  excerpts  irom  the  conversa- 
tions to  Jupporl  thu  opinion.  The  report  would  (not!  be  admissible  ...  to  make 
a prima  facie  cik  that  X,  Y and  Z are  Comm  mistj  . . . Such  evidence  might  be 
u*ed  h a p^rcecdir^  for  the  cancellation  of  a nalutaluitWft  certificate. 

To  be  sure,  if  the  unreliability  of  the  individual  response  pisses  a certain  point,  It  may 
Invalidate  a survey. 
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be  so  simple,  straight-forward  and  unambiguous  that  the  hearsay  dangers 
must,  in  fact,  be  negligible.  Hence,  survey  evidence,  if  properly  procured," 
is  well  covered  by  Wigmore’s  formulation  of  the  rationale  that  underlies 
all  exceptions  to  the  hearsay  rule:  “Where  circumstances  are  such  that  a 
sincere  and  accurate  statement  would  naturally  be  uttered,  and  no  plan 
of  falsification  be  formed.  . . .m* * 

Survey  Interviewees  as  Witnesses 

Some  court  decisions  suggest  that  a right  of  tbs  adversary  to  call 
survey  interviewees  as  witnesses  might  help  to  overcome  the  hearsay 
obstacle  by  fortifying  the  reliability  of  the  survey  evidence®0  Although 
in  some  cases  this  procedure  has  proved  feasible,  and  might  seem  ad- 
visable in  others,  there  are  strong  reasons  against  making  it  a general 
requirement  for  the  acceptance  of  surveys.  Cross-examination  of  selected 
survey  interviewees  is  likely  to  be  misleading.  It  may  lend  the  aura  of 
reliability  to  an  incompetent  survey,  or  it  may  destroy  confidence  in  a 
survey  which  deserves  better.  Careful  studies  have  shown  that,  on  re- 
interviewing,  one  always  finds  re*pondents  who  charge  their  original 
response.51  These  studies  have  also  shown,  however,  that  such  changes, 
as  a rule,  do  not  affect  the  reliability  of  the  survey.  This  paradox  requires 
explaining  although  such  explanation  might,  in  fact,  encourage  the  vety 
practice  against  which  it  is  aimed. 

A change  of  response  may  be  due  to  any  number  of  causes,  c.g.,  simple 
response  error  during  either  the  first  or  second  interview.  Even  on  such 
factual  items  as  agj,  a small  but  definite  group  of  respondents,  on  re- 
interviewing, will  give  a different  response.  Moreover,  changes  may  occur 
which  reflect  opinions  and  attitudes  which,  in  fact,  may  have  been  altered 
between  the  two  interviews.  Finally,  the  very  fact  of  having  been  pre- 
viously interviewed,  may  precipitate  the  change.  The  artificial  stimulus 
of  the  interview  may  generate  subsequent  conversations  or  inquiries  on 
the  part  of  the  respondent  which,  in  turn,  will  affect  the  original  uncon- 
laminatcd  attitude.  A subpoena  and  what  follows  may  prove  even  more 

11  I Wigmcre,  Eikioite  I 142 1 ()d  cd.  1940). 

to  *.  . . nor  my  of  the  persons  allegedly  later  vkwtd  were  called  a*  witnesses.  . * . 
General  Dry  Billeries,  Inc.  v.  Ray-O-Vat  Co^  45  Tradrmirk  Rep  5f$.  594  (1955)— "Any 
Information  he  lihe  field  representative!  could  give  on  Ike  witness  Hind,  would  . . . bavt 
amounted  to  beamy  bawd  upon  bears*)/’  Irvin  v.  Slit*,  $6  So.  2d  25$,  *91  (Fit.  1955). 
The  assurance  that  the  tntervxwee*  could  be  oiled  as  witnesses  wu  made  a condition  of 
\\t  tdmi«<kn  of  the  survey  in  Fverilt  Hat  Company  v.  Sckum  Hat  Company,  MihraakH 
County,  Circuit  Court,  July  11,  1955  (Wise  ),  unreported  opinion,  quoted  in  Barksdale, 
supta  note  5.  Tbe  defendmt  did  not  attempt  to  foilify  its  Survey  through  any  such 

* finesses."  Oneida  v.  National  Stfvtf  Co„  supta  note  46  at  117.  Defendant  subpoenaed  11 
witnesses  who  had  signed  the  surrey.  Quaker  Oats  Co.  v.  General  M3U,  l»ci  supta  note  41 
at  UK 

H Kendall,  Conflict  afrd  Mood;  Factors  Affecting  Stability  of  Response  (1954), 
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disturbing;  before  the  cross-examination  is  finished,  so  many  influences 
will  have  become  operative  since  the  first  interview,  that  a discovered 
discrepancy  would  prove  little  or  nothing.  In  such  a case  all  the  weight 
would  have  to  be  accorded  to  the  first  response. 

One  might  argue  that  there  is  some  value  in  cross-examining  a re- 
spondent as  to  the  circ  .nistances  of  his  being  questioned  and  the  mode 
of  recording  his  answers.  But  these  same  circumstances  can  also  be 
elicited  from  a cross-examination  of  the  interviewers. 

The  discovery  of  discrepancies  on  the  individual  level  need  not  in- 
validate the  survey  results,  for  while  some  interviewees  may  switch  from 
position  a to  b,  an  approximately  equal  number  will  switch  from  position 
b to  a.  The  difficulty  lies  in  the  fact  that  if  such  occasional  individual 
changes  are  revealed  in  court  through  cross-examina'ion,  it  may  tend  to 
impeach  the  survey  because  the  over-all  compensating  effect  cannot  be 
shown.” 

There  is  still  another  reason  against  encouraging  this  practice.  Unless 
the  courts  protect  the  survey  Interviewees  from  subsequently  being  called 
as  trial  witnesses,  bona  fide  surveys  for  purposes  of  settling  legal  issues 
will  become  more  and  more  difficult  to  conduct.  The  problem  is  en- 
gendered not  so  much  by  the  potentially  large  number  of  such  witnesses, 
but  primarily  by  a peculiar  condition  under  which  surveys  operate.  In- 
terviewers of  private  survey  organizations  are  finding  open  doors  and 
willing  respondents  because  they  are  scrupulously  observing  a canon 
which  they  share  with  the  Census  interviewer.  They  will  not  voluntarily 
identify  individual  answers  by  exposing  their  respondent.  This  assurance 
Is  given  either  explicitly  or  implicitly  by  all  reputable  survey  organiza- 
tions. Their  operations  would  come  to  a halt  If  It  were  known  that  an 
interviewee  might  have  to  pay  for  his  cooperation  by  being  called  into 
court  and  there  exposed  to  the  doubtful  pleasures  of  cross-examination. 
The  law  specifically  prohibits  any  disclosure  of  the  answers  of  the  re- 
spondent in  Census  surveys  in  order  to  insure  truthful  response  and  to 
avoid  embarrassment.  No  other  survey  response  enjoys  tins  protection. 
A private  survey  organization  cannot  assure  its  interviewees  of  confi- 
dential treatment  If  the  court  orders  the  production  of  Its  survey  ques- 
tionnaires.11 The  knowledge  of  this  threat  keeps  many  a survey  organiza- 
tion from  accepting  such  legal  work. 


•*  In  tun  of  major  importance,  tit  survey  organirathft  tniy  atternrt  to  document  tils 
by  re-intervicwing  part  of  t>«  respondents  end  tosnparing  tie  paired  results. 

tl  Tbt  bsue  arose  before  an  PCC  examiner  concerning  tie  sale  of  station  WGMS.  It  also 
arose  during  the  bearings  before  an  examinet  for  the  U S.  Department  of  Libor  concerning 
a autvey  of  wages  paid  in  tbt  tteclrk  lamp  Industry. 
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There  is  a melhod  of  avoiding  this  dilemma.8*  The  identification  of  the 
survey  respondent  can  be  written  on  a perforated  section  of  the  ques- 
tionnaire and  separated  after  the  supervisor  has  passed  it  as  satisfactory. 
The  separated  names  will  form  a record  of  all  respondents,  but  since 
none  can  be  identified  with  any  specific  response,  they  can  be  cross- 
examined  only  as  to  whether  they  had  been  interviewed  at  all.  The  mu- 
tilation of  questionnaires,  of  course,  might  be  misinterpreted,  but  a proper 
explanation  should  eliminate  this  danger.85 

For  these  reasons,  then,  the  cross-examination  of  the  survey  respond- 
ents should  not  be  required  as  a rule.  In  the  vast  majority  of  cases  it 
will  not  aid  in  the  evaluation  of  the  survey  but  might,  on  the  contrary, 
only  confuse  the  issue.  Whether  the  law,  so  hesitant  to  extend  the  area 
of  privileged  communications,  will  soon  respond  to  these  needs  is  doubt- 
ful. 

The  Impeachment  of  Surveys 


The  discussion  thus  far  has  tended  to  establish  that  neither  the  fact 
that  a survey  is  based  on  a sample,  nor  that  it  relates  hearsay  evidence 
should,  in  itself,  bar  its  admission  into  evidence.  To  be  sure,  there  is  the 
danger  that  the  weakness  of  survey  evidence  may  be  hidden  under  a 
pretending  surface.  The  following  discussion  will  suggest  that  under  the 
guidance  of  expert  witnesses  any  such  defects  can  be  satisfactorily  ex- 
posed and,  hence,  most  danger  avoided.  The  courts  generally  have  re- 
fused admittance  to  surveys  in  which  they  have  discovered  technical 
flaws,  rather  than  admitting  them  and  permitting  their  impeachment.  Ad- 
ministrative agencies,  on  the  other  hand,  are  more  liberal  in  their  prac- 
tice, partly  because  they  ate  not  bound  by  the  ordinary  rules  of  evidence, 
and  partly  as  a result  of  their  familiarity  with  the  specific  technical  prob- 
lems before  them.  Moreover,  many  cases  before  these  agencies  are  of 
the  sort  that  can  hardly  be  decided  without  the  assistance  of  survey  evi- 
dence. 

A court  sitting  without  a jury  will  seldom  hesitate  to  admit  a survey 
in  evidence.  The  Supreme  Court  has  never  either  reversed  or  criticised  a 
trial  court  for  admitting  survey  evidence  in  a civil  case  tried  without  a 


»«  t.  «ai  fmplovrd  in  a mmv  bv  I he  Federal  Focd  and  Dm*  Administration  in  United 
State*  v.  355  cawi”*  Mountain  Mineral  Yatky  Water,  Civil  Xo  565,  ED.  Ark,  May  1956, 
as  In  a survey  deigned  for  an  FCC  faring,  the  interviews  were  concluded  with  the 


IcHowing  statement:  . , ..  .. 

A*  ycu  m y know,  reputable  survey  aptnoes  never  make  known  any  indinduali 
opinion  without  bis  pemissi© ft.  1 be  sponsor  of  Ibis  *u ivey  is  ut king  an  applkatka 
to  operate  the  mvke  we\e  been  diseussiftg  with  you.  Would  you  be  willing  to  havt 
your  name  referred  to  hi » in  connexion  with  the  opinions  you  have  Just  expressed  f 


This  approach  is  straightforward,  but  mas*  kad  to  self-sekxlkn  of  those  respondent*  who 
are  more  likely  to  stand  by  their  outspoken  opinion*.  The  result  must  be  a biased  group 


_ _ more  . 
of  witnesses  of  no  value  to  tbe  court. 
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jury”  The  court  may  indeed  admit  a survey  even  if  it  h?s  no  confidence 
in  its  probative  value: 

It  is  doubtful  that  such  an  exhibit  fa  survey  of  farm  machinery  outlets 
in  Iowa,  conducted  by  the  Statistical  Laboratory  of  Iowa  Stale  College] 
has  any  relevancy  ....  [B)ut  the  Court  concludes  that  there  is  no  need 
to  strike  the  exhibit.  It  may  remain  in  evidence  for  what  it  may  be 
worth.  However,  it  should  be  slated  that  the  conclusion  hereinafter  indi- 
cated . . . would  be  the  same  in  absence  of  this  documentary  evidence.*7 

When  sitting  with  juries,  however,  the  courts  prefer  to  exclude  surveys  if, 
on  preliminary  examination,  they  find  flaws  in  them,  rather  than  admit 
them  for  whatever  they  may  be  worth.  Where  the  line  between  exclusion 
and  admittance  ought  to  be  drawn  should  depend  upon  how  difficult  it  is 
effectively  to  impeach  a bad  survey. 

Improper  Universe 

There  are  three  critical  points  at  which  a survey  operation  can  fail  and 
provide  ground  for  its  impeachment.  First,  the  survey  may  have  been 
directed  at  a universe  which  is  irrelevant  to  the  litigated  issue.  In  such 
cases  of  obvious  error,  the  court  will  not  need  expert  advice. 

. . . (l]nlerviewer$  stopped  [the  respondents)  in  front  of  one  of  the  appel- 
lant’s stores  in  San  Francisco  and  asked  them  in  what  manner  they  spoke 
of  'Ltrner  Shops.’  Obviously  the  results  of  such  a survey  are  of  little 
value  in  determining  what  knowledge  residents  of  San  Jose  had  of  ‘Lerner 
Shops’  ...» 

Or, 

. . . (T]he  survey,  having  been  limited  to  retailer*.  Is  inadmissible  to 
show  that  in  the  market  of  ultimate  consumers  the  plaintiffs  design  had 
acquired  a secondary  meaning.** 

Incdequeie  Semple 

It  b not  always  obvious  that  a survey  reflects  an  improper  universe. 
Sometimes  it  purports  to  represent  the  correct  universe  but,  in  fact,  does 
not  do  so.  This  is  the  second  point  at  which  a survey  may  prove  in- 
adequate. The  universe  may  be  properly  selected,  but  the  sample  designed 
to  represent  it  may  be  faulty.  At  this  point  the  expert’s  help,  as  a rule, 
will  be  needed  to  explain  the  magnitude  of  unavoidable  flaws  or,  as  the 
case  may  be,  of  any  errors  in  sampling. 

The  quality  of  any  sampling  procedure  depends  both  on  its  basic 
design  and  its  execution.  The  expert  will  readily  discover  its  deficiencies 
from  the  report  itself,  supplemented  by  internal  documents  and  such 

L'nktd  Slim  v.  United  Sho*  Machinery  Cotp,  lup  i not*  21. 

» United  Statei  v.  1. 1.  Cim  Co,  JOt  F.  Supp.  8S4,  SW  (D.  Minn.  ISM). 

« Lerr.-r  SivtM  v.  Utf.tr,  1(2  F.Jd  ISO,  IS2  («d»  Cir.  P47). 

•*  Afctrkaa  L«;;t£e  Work*  t.  United  State*  Trunk  Co,  ISS  F.  Supp.  SO,  SI  (D. 
Man.  l»M|. 
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testimony  from  the  survey  staff  as  may  be  necessary.  Questions  directed 
at  discovering  the  manner  in  which  a respondent  was  selected  for  inter- 
viewing should  provide  all  the  information  an  expert  will  need.  The  sur- 
vey staff,  from  the  director  down  to  the  field  interviewers,  must  be  avail- 
able for  cross-examination.  While  cross-examination  of  all  interviewers 
should  be  avoided,  the  court  should  not  refuse  to  hear  as  many  as  are 
needed  to  clarify  the  exact  modalities  under  which  the  survey  was  con- 
ducted. 

To  detect  deviations  from  instruction  will  require  a more  detailed 
probing,  primarily  by  questioning  the  supervisory  staff  and  randomly 
selected  interviewers/0  Questions  should  be  directed  at  the  institutional 
safeguards  against  error  (substituting,  without  permission,  respondent  B 
for  respondent  A)  and  against  the  admittedly  rare  occurrence  of  faking 
parts  or  the  whole  of  an  interview.  These  safeguards  may  include  proper 
recruitment,  training,  and  supervision  of  the  field  staff,  as  well  as  spot 
controls  and  double  checks  of  the  particular  survey  sample. 

One  of  the  more  easily  overlooked  sampling  traps  may  arise  from 
what  fs  technically  called  non-response.  There  are  always  some  indi- 
viduals in  any  sample  from  whom  it  is  impossible  to  obtain  the  desired 
information,  either  because  they  could  not  be  located  (e.g.,  were  not  at 
home)  or  because  they  refused  to  answer  the  questions  asked  of  them. 
An  effort  to  measure  the  size  of  broadcast  audiences,  for  example,  must 
go  far  astray  if  it  bases  its  findings  only  upon  the  people  found  at  home. 
It  will  exaggerate  the  true  audience  because  such  people  are  more  likely 
to  listen  to  broadcasts  than  those  who  are  away  from  home  and,  hence, 
omitted  from  the  survey.  There  are  several  techniques  for  dealing  with 
this  difficulty,  all  aimed  at  an  estimate  of  how  the  non-respondents 
would  have  responded  had  they  been  reached  and  Interviewed. 

As  has  been  pointed  out,  however,  no  sample  is  ever  without  short- 
comings: the  exigencies  of  costs,  accidents,  and  other  circumstances 
may  escape  control  and  introduce  bias.  It  is  the  expert’s  preeminent  task 
to  enlighten  the  court  as  to  the  relevance  of  such  flaws  fn  respect  to  the 
measured  issues.  The  point  is  an  Important  one,  for  even  though  a sample 
have  many  flaws,  it  may  be  judged  sufficient  for  deciding  a particular 
issue.  The  surveys  of  the  late  Dr.  Kinsey  are  illustrative  although,  of 
course,  they  have  not  come  before  the  courts.  The  "Human  Male”  and 
"Female”  were  represented  only  by  those  111-.' sorted  men  and  *om«n 
who,  by  one  means  or  another,  could  be  persuaded  to  be  interviewed. 

There  is  a standard  method  of  estimating  the  survey's  true  value  from 

t*lt  b bid  ptitlke  to  permit  (be  tsrvry  crgininlkta  to  prwrtU  iti  but  lote’vfartf 
to  th«  resit;  i tutvtjr  ii  etdy  u toed  »» in  rrntnt  BnL 
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such  an  improperly  selected  sample.  First,  the  degree  of  under-  or  over-  l 
representation  of  certain  sub-groups  is  determined,  e.g.,  too  many  college 
educated,  not  enough  laborers.  One  then  estimates  what  the  over-ail, 
corrected  group  average  would  have  been  had  the  sub-gv  ups  been  repre- 
sented in  their  true,  known  proportions.91  While,  of  course,  such  estimates 
lack  the  precision  of  probability  samples,  they  will  often  be  satisfactory  1 
if  the  gioup  measure  clearly  falls  beyond  the  crucial  minimum  require- 
ment. But  more  often,  if  a sample  is  improperly  designed,  the  expert  will 
be  unable  to  appraise  the  size  of  its  bias.  J 

However,  even  a properly  designed  and  well  executed  sample  may 
prove  wanting,  simply  because  it  is  too  small  to  provide  the  desired 
response,  i.e.,  the  sampling  error  may  be  too  great.  This  difficulty  may  j 
derive  from  the  fact  that  a sample  can  be  sufficiently  large  to  answer  i 

some  questions,  but  too  small  to  answer  others.  Here,  again,  only  the  | 

expert  can  advise  the  court  with  precision.82  j: 

Circumstances  of  the  Interview  j 

The  third  point  at  which  the  validity  of  a survey  must  be  tested  is  at 
its  line  of  questioning  and  the  circumstances  under  which  the  interview 
was  conducted.  Lawyers  know  that  there  is  more  than  one  way  of  posing 
a question  and  that  the  response  in  each  case  may  be  different.  Such  dif- 
ferences may  result  from  the  phrasing  of  the  individual  questions,  from  j 
their  sequence,  or  from  the  questioning  situation.  There  is  a body  of 
experience  from  which  the  expert  will  be  able  to  guide  the  trier  of  facts. 

He  will  detect  bias  where  the  layman  sees  none,  he  will  know  where 
memory  failure  will  tend  to  underrate  and  where  vanity  may  have  the 
opposite  effect,  and  he  will  know  also  when,  at  times,  the  interviewer’s 
personal  opinion  affects  his  respondent’s  answers.83  ' ' ; 

Yet  it  is  axiomatic,  in  survey  technique,  that  the  danger  of  question  bias 
increases  with  the  complexity  and  ambiguity  of  the  questions.  If  their 
aim  is  simple  and  factual,  such  as  determining  the  make  of  the  re- 
spondent’s automobile,  neither  the  form  nor  the  sequence  of  the  questions 
will  make  much  difference.  But  in  the  survey  question  “As  a guess,  how 

81  An  example  of  this  procedure  can  be  found  In  Zeisel,  “Sexual  Behavior  In  the  Human 
Female,”  21  U.  Chi.  L.  Rev.  517,  519  (1954). 

It  might  be  remembered  that  thr  Issue  of  the  adequate  sample  size  becomes  relevant 
only  with  a good  sample.  As  the  Literary  Digest  experience  showed,  if  a sample  Ir  wrorwly 
designed  its  being  targe  makes  It  only  worse,  because  it  makes  it  less  probable  that  the 
systematic  bias  in  sampling  is  cancelled,  or  at  least  reduced,  by  the  sampling  error  workin; 
accidentally  In  the  opposite  direction. 

83  Hence,  the  requirement  that  the  interviewer  knew  as  little  about  the  purpose  of  the 
survey  as  Is  compatible  with  his  duties.  For  a complete  examination  of  all  such  Interview- 
ing problems,  compare  Hyman,  Interviewing  in  Social  Research  (I95t). 
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much  wax  «-ould  you  fay  there  was  in  glass  wax  . . . the  court  rightly 
found  “a  built-in  bias.”6* 

In  another  case  it  was  the  sequence  of  questions  which  was  found  to 
color  the  results.  At  issue  was  the  confusion  of  “Ail  State”  and  “All 
States  Life  Insurance”: 

i 

(3)  What  does  ‘All  State*  mean  to  you? 

\ (4)  If  you  wanted  All  Slate  Insurance  where  would  you  go? 

(5)  Have  you  ever  heard  of  All  States  Life  Insurance  Company? 

(6)  Who  would  you  tay  owns  All  States  Life  Insurance  Company? 

J The  court,  with  justification,  criticized  the  survey  for  “not  fahly  pre- 
dating the  name  All  State.”65 

( Other  aspects  of  an  interview  can  also  become  grounds  for  criticism. 
( Word  association  tests  given  to  students  in  a classroom  were  rejected 
because  their  reactions  v,Tere  “bound  to  differ  from  that  of  the  buyer  in 
the  market  place  when  confronted  with  the  . . . beverage.  . . ,”66  As  an- 
| other  court  remarked,  “the  issue  is  not  whether  the  goods  would  be  con- 
fused by  a Qasual  observer,  but  [rather]  ...  by  a prospective  purchaser 
j'  at  the  time  he  considered  making  the  purchase.  If  the  interviewee  is  not 
in  a buying  mood  but  is  just  in  a friendly  mood  answering  a pollster,  his 
degree  of  attention  is  quite  different.”67 
| A rather  subtle  source  of  bias  was  noted  by  a court65  in  the  selection, 
f as  the  inter  dewing  area,  of  “the  vicinity  of  Syracuse,  which  is  not  far 

i’  ...  .from  the  town  in  which  plaintiff’s  goods  are  manufactured,”  thus 

1 producing  an  abnormally  high  degree  of  confusion  with  his  brand. 
t’  Occasionally,  the  problem  at  issue  will  present  genuine  difficulties  to 
the  interviewee.  Such  an  issue  aro  ,e  at  a recent  heaving  before  an  FCC 
examiner  concerning  the  sale  of  radio  station  WGMS  in  Washington, 
D.C.  When  survey  results  purporting  to  show  the  audience’s  preference 
for  "classical”  and  “semi-classical  music”  were  presented,  it  was  con- 
tended that  the  respondents  did  not  understand  the  meaning  of  these 
p terms.65  The  problem  of  ambiguity  might  be  avoided  by  posing  a battery 

j’  of  questions,  each  of  which  would  cover  one  facet  of  the  ambiguous  con- 

1 ' cc-pt.70  Together  they  would  insure  that  the  interviewee  has  responded  to 

jj  all  aspects  of  the  concept. 

l!  M S.  c.  Johnson  & Son,  Inc.  v.  Grid  Seal  Co.,  230  F.2d  832  (D.C.  Cir.  19S6),  40 
. . Trademark  Rep.  347,  3S7-5S  (1950),  aff’d,  90  U.S.P.Q.  373  (1951). 

I '•’>  Scars  Roebuck  & Co.  v.  t"  Slates  Life  Ins.  Co.,  246  F.2d  161,  171,  172  (Sth  Ctr. 

j-  !9S1). 

; « Coca-Cola  v.  Nefci  Corp,  27  Del.  Ch.  318,  326,  36  A.2d  156  (1944). 

u Judge  Wyzanski  In  American  Luggage  Works  v.  United  States  Truok  Co.,  supra  note 
:!  37  at  53. 

f'il  Oneida,  Ltd.  v.  National  Silver  Co , supra  cote  46  at  286. 

[•'  52  Broadcast  Telecasting  Magazine  68,  February  11,  1957. 

, Compare  Lazarsfeld,  Comment  appended  to  the  Blum  and  Kalven  article,  supra  note  3. 
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The  hearsay  problem  also  may  arise  as  a result  of  survey  techniques. 
In  Orbo  Theatre  Corp.  v.  Loew’s,  Inc.,n  the  trial  court  refused  to  admit 
a survey  in  which  the  respondents  were  asked  about  their  movie  going 
habits  and  those  of  their  families.  This  added  level  of  hearsay  constituted 
one  of  the  barriers  for  admittance  of  the  survey. 

By  way  of  conclusion,  then,  we  may  say  that  a survey  can  err  in  at 
least  three  basic  ways: 

(1)  it  may  aim  at  an  irrelevant  universe; 

(2)  although  aiming  at  the  right  universe,  it  may  not  be  representative 
because  of  faulty  sampling,  or  it  may  be  based  on  too  small  a 
sample  and  hence  render  measurements  that  are  not  sufficiently 
precise; 

(3)  thi  mode  of  questioning,  the  interviewing  situation,  or  the  se- 
quence of  questions  may  tend  to  reflect  inaccurately  the  char- 
acteristics at  which  the  survey  aims. 

Experts  and  Procedural  Safeguards 

Courts  and  juries  are  presumed  capable  of  judgiug  the  value  of  wit- 
ness testimony.  They  are  trusted  to  determine  if  such  testimony  is  ir- 
relevant or  otherwise  defective  and  without  weight.  Because  of  a tend- 
ency to  present  survey  findings  in  a way  that  makes  them  appear  simple 
and  judgeable  by  the  laymen,  the  detection  of  defects  in  survey  evidence 
and  the  evaluation  of  its  '••.eight  and  significance  is  not  a simple  task. 
Bence',  no  survey  should  be  presented  without  accompanying  expert  tes- 
timony. The  expert  will  be  needed  precisely  because  of  the  .urface  sim- 
plicity which  almost  all  surveys,  good  or  bad,  tend  to  display.  Thought 
should  also  be  given  to  the  possibility  of  appointing  impartial  survey  ex- 
perts, either  by  agreement  of  the  litigants  or  by  choice  of  the  court.  They 
would  be  analogous  to  impartial  medical  experts.72 

Survey  evidence  has  still  another  peculiarity  which  should  guide  its 
use  in  court:  its  production  is  usually  very  costly  and  time-consuming. 
To  confront  the  adversary  with  a survey  only  at  the  time  of  trial  will 
almost  always  constitute  an  unfair  surprise.  Surveys,  in  this  respect, 
ought  to  be  treated  much  like  experiments  which,  in  purpose  and  mode 
of  analysis,  they  closely  resemble.”  Two  procedural  suggestions  offered 
by  Professor  McCormick  in  regard  to  experimental  evidence  can  apply 
without  modification  to  survey  evidence: 

” n 156  F.  Supp.  770  (D.D.C.  1*57). 

72  Sec  Chapter  11,  discussing  Impartial  Medical  Experts  in  Zeisel,  Kalven  and 
Buchhoh,  Delay  In  the  Court,  (1959);  see  also  Zeisel,  '‘The  New  York  Expert  Testimony 
Project:  Some  Reflections  on  Legal  Experiments,1 n 8 Stan.  L.  Rev.  7J0  (1956). 

73  McCormick,  Evidence  } 169  (1954);  Zebel,  Say  It  With  Figures  132  (4th  ed.  1957). 
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. . , [T]be  adversary  system  . . . must  be  modified  ...  by  a rule  of 
court  providing  first,  that  no  experiment  [survey]  shall  be  used  in  evi- 
dence unless  reasonable  notice  shall  have  been  given  the  adversary,  with 
an  opportunity  to  make  suggestions  as  to  planning  and  to  be  present  at  the 
test:  and  second,  empowering  the  court,  in  discretion,  on  application  of 
either  party,  .0  appoint  an  impartial  person  to  conduct  ...  an  experiment 
[survey].74 

Where  both  sides  can  agree  on  an  impartially  conducted  survey,  this 
would  have  the  added  advantage  of  dividing  the  survey  costs.* 7® 

Toward  a Survey  Exception 

The  law  with  respect  to  survey  evidence  is  still  far  from  settled  doc- 
trine. Thus  far,  the  development  has  been  guided  by  fears  that,  since 
most  surveys  are  hearsay  evidence,  a bad  one  might  too  easily  mislead 
the  trier  of  facts.  But  our  discussion  has  shown  that  while  the  dangers 
of  an  uncritically  received  survey  are  real  enough,  they  derive  not  from 
its  hearsay  character,  but  primarily  from  elements  easily  opened  to  ex- 
pert review.  If  such  expert  help  is  available  to  the  court  and  the  parties 
to  the  trial,  the  dangers  arising  from  the  admittance  of  survey  evidence 
are  much  smaller  than  is  reflected  by  the  ru'es  which  presently  govern 
their  admission.  These  dangers  will  become  negligible  if,  in  the  prepara- 
tion and  presentation  of  survey  evidence,  the  following  safeguards  are 
observed: 

(1)  All  sampling  plans,  instructions  to  field  workers,  questionnaires 
and  other  survey  instruments  ought  to  be  available  as  evidence 
of  its  design. 

(2)  The  survey  staff,  from  the  director  down  to  the  ultimate  field 
workers,  should  be  available  for  questioning  as  to  the  survey’s 
manner  of  execution.  The  survey  interviewees,  as  a rule,  ought 
not  to  be  required  to  testify. 

(3)  The  survey  evidence  should  be  presented  by  an  expert  witness. 

(4)  If  a survey  is  planned  during  the  course  of  the  litigation,  the 
court  should  explore  the  possibility  of  having  the  survey  con- 
ducted by  stipulation  of  parties  through  an  agreed-upon  or  court 
appointed  impartial  expert.  At  that  time,  such  technical  require- 
ments as  size  of  sample  and  other  specifications  could  also  be 
stipulated.  If  this  should  not  prove  feasible,  a litigant  intending 

74  McCormick,  Evidence  { 169,  p.  362  (1954). 

7®  • • . IWJhere  contending  parties  , , , can  agree  on  the  making  of  a survey  (which 
may  or  may  not  mean  agreeing  to  be  bound  by  the  results  of  the  survey),  many  it  not 
most  of  the  really  controversial  problems  about  survey-making  will  be  eliminated. 
Walerbury,  "Opinion  Surveys  in  Civil  Litigation,"  44  Trademark  Rep.  343,  361  (1954). 
See  also  Note,  66  Harv.  L.  Rev.  498  (1953).  v ' 
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to  offer  a survey  in  evidence  should  be  required  to  notify  his 
opponent  early  enough  to  enable  him  to  become  an  observer  in 
its  development.  If  the  survey  was  completed  prior  to  the  com- 
mencement of  the  litigation,  it  should  be  disclosed  to  the  ad- 
versary well  in  advance  of  the  trial. 

If  these  safeguards  are  provided,  the  court  should  be  satisfied  that  the 
evidential  value  of  the  survey  can  be  appraised  objectively;  hence,  nothing 
should  prevent  its  admission,  provided  it  is  relevant  to  the  litigated  issue. 

If  these  safeguards  are  not  provided,  the  court  ought  to  refuse  to  admit 
the  survey. 

While  the  law  might  ultimately  develop  in  this  direction,  its  present 
state  gives  only  small  encouragement.  A reading  of  the  Model  Code  of 
Evidence  and  the  proposed  Uniform  Rules  of  Evidence  shows  quite 
clearly  that  the  isolated  cases  in  which  survey  evidence  was  admitted 
are  far  from  developing  into  a settled  doctrine.  The  Uniform  Rules  do 
not  admit  surveys  unless  they  qualify  under  the  state-of-mind  exception 
or,  perhaps,  as  “entries  made  in  the  ordinary  course  of  a bu-iness,”  this 
exception  would,  at  best,  apply  only  to  surveys  made  prior  to,  and  un- 
connected with,  the  litigation.  However,  since  surveys  provide  the  best, 
if  not  the  only,  evidence  on  certain  issues,  and  since  expert  knowledge 
in  the  field  has  advanced  sufficiently  to  protect  the  trier  of  the  facts 
from  error,  the  law  may  well  lower  its  heavy  guard.  i 
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UNIVERSE  AND  SAMPLE  I I 


The  necessary  compromise  enforced  by  limited  resources 
which  Mueller  and  Schuessler  refer  to  in  their  discussion  of 
sampling  procedures  is  clearly  illustrated  in  the  following  reading. 
By  defining  his  universe  as  "all  individuals  injured  or  killed  in 
auto  accidents  during  a calendar  year,  " Conard  was  forced  to  use 
sampling  techniques.  However,  as  he  points  out,  "The  ideal 
sampling  source  . . . did  not  exist;  however,  three  potential 
sampling  sources  were  available.  These  were  insurance  company 
records,  hospital. reports,  and  police  accident  reports."  And,  as 
a consequence,  the  sample  drawn  would  necessarily  be  biased. 
Insurance  company  records  do  not  include  injured  non-insured 
persons,  hospital  records  exclude  individuals  treated  in  doctors' 
offices,  and  not  all  personal  injury  accidents  are  reported  to  the 
police. 


In  addition  to  this  problem  of  sampling  sources,  there  is 
another  aspect  of  the  reading  which  is  useful  to  the  student  of 
social  science  methodology.  The  author  takes  great  pains  to 


make  it  clear  how  he  drew  his  samples  and  collected  his  data, 
(In  fact,  his  methodological  narrative  is  a good  example  of  what 
Howard  Becker  called  a "natural  history.  ")  Conard  presents 
the  nature  of  his  procedures  at  each  stage  of  the  research,  and 
any  challenge  against  his  findings  can  be  directed  at  the 
methodological  structure  of  his  study  instead  of  trailing  off  into 
a vague  skepticism  aimed  at  the  social  sciences  in  general. 


From  Alfred  F,  Conard,  et  al. , Automobile  Accident 
Costs  and  Payments  (Ann  Arbor:  The  University  J. 
Michigan  Press,  1964)  pp.  9-11;  322-366. 
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The  study  was  born  of  a suspicion  that  the  handling  of  persona! 
injury  cases  is  among  the  most  critical  problems  facing  the  legal 
profession  today.  Clearly  a large  part  of  the  public  believes  that 
ambulance-chasing  and  outrageo-j;  fees  ore  commonplace.  The 
waiting  period  to  get  to  trial  in  many  major  cities  is  notorious. 
Even  legal  theory  is  showing  symptoms  of  malaise,  as  attacks  are 
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made  on  contributory  negligence,  damages  for  pain  and  suffering, 
and  exclusion  of  evidence  of  insurance. 

The  designers  of  this  study  did  rot  wish  to  contribute  to  the 
welter  of  opinions  on  legal  theory,  nor  even  to  add  to  the  studies 
of  client  procurement,  attorney  compensation,  and  trial  delays. 
* hey  conceived  the  notion  that  a new  start  should  be  nrdc  by 
studying  the  underlying  human  demand  whose  pressures  have 
bubbled  forth  in  the  form  of  a “fee  problem,”  a "delay  problem," 
and  other  "problems.”  This  human  demand  was  conceived  to  be 
the  desire  for  something  to  fill  the  trench  in  material  well-being 
which  is  gouged  by  a personal  injury. 

The  grand  design  was  to  discover  what  are  the  economic  losses 
from  injury,  and  what  is  being  done  to  repair  these  losses.  It  svas 
supposed  that  the  trail  would  lead  back  to  the  point  of  initiv \ 
curiosity — injury  litiga'icn;  but  it  might  lead  in  a good  many  other 
directions,  which  would  be  equally  instructive. 

Three  separate  methods  of  study  were  adopted.  The  first  method 
was  to  collect  and  analyze  national  statistics  on  programs  which 
would  presumably  come  to  the  aid  of  an  injury  victim.  The  results 
of  this  approach  constitute  Part  I of  this  report. 

The  second  method  was  a field  survey.  It  began  with  interviews 
with  persons  involved  in  personal  injury  automobile  accidents. 
Later,  thanks  to  an  additio.  al  grant  of  funds,  it  was  extended  to 
include  interrogation  of  lawyers  for  the  injury  victims,  lawyers 
for  defendants,  individual  defendants,  and  hospitals.  This  led  to 
the  heart  of  the  study,  which  is  the  survey  of  Michigan  automobile 
injuries,  reported  in  Part  II. 

The  third  method  was  an  inquiry  into  foreign  systems  for 
dealing  with  the  same  human  demand.  Foreign  laws,  lawyers, 
courts,  r.nd  insurance  companies  may  differ  from  their  American 
counterparts,  but  modern  foreign  countries  are  sure  to  have  the 
same  human  demands,  occasioned  by  accidental  injuries.  Inform- 
ants from  England,  France,  Sweden,  and  West  Germany  supplied 
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information  on  sources  of  reparation  for  automobile  injury  victims 
in  their  respective  countries'. 
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Survey  Methods:  A Description 
Introduction 

This  chapter  describes  the  research  design  developed  for  the 
Michigan  survey.  Each  of  the  independent  components  of  the  final 
design  is  presented  separately,  with  emphasis  on  the  underlying 
design  considerations,  sample  composition,  questionnaire  content, 
ar.i  response  and  weighting  characteristics  for  each.  These  dis- 
cussions are  preceded  by  a description  and  definition  of  the  group 
of  individuals  (universe)  being  studied,  as  well  as  an  overview  of 
the  tota1  survey  design.  The  present  chapter  also  describes  the 
’Veighung"  procedures  used  for  the  various  parts  of  the  study, 
the  procedure  used  to  substitute  plaintiffs  from  the  court  sample 
for  plaintiffs  from  the  police  sample,  and  the  procedure  used  to 
estimate  income  loss  for  sampled  individuals,  and  the  techniques 
used  to  collect  and  process  the  data. 

/ The  following  chapter  evaluates  the  research  design  described 

here,  placing  special  emphasis  on  the  extent  to  which  the  assump- 
tions and  design  specifications  could  have  introduced  serious  in- 
accuracies into  the  final  lesults. 

A.  The  Universe  Defined 

111*  universe  (the  ratal  group  of  persons  being  studied)  in- 
cluded all  individuals  injured  of  killed  in  automobile  accidents 
that  occurred  in  the  State  of  Michigan  during  one  calendar  year,  as 
well  as  non  injured  drivers  and  owners  involved  in  these  accidents. 
The  latter  were  inch  Jed  so  that  the  estimate  of  the  total  economic 
loss  resulting  from  personal -injury  automobile  accidents  would 
Include  an  accurate  valuation  of  property  damage  which  might 
hate  been  compensated  along  with  the  personal  injuries.  To  state 
this  another  way,  the  sample  for  this  study  was  selected  so  that 
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inferences  could  be  made  about  various  characteristics  of  all  per- 
sonal-injury accidents  occurring  in  Michigan  during  one  calendar 
year.  A calendar  year,  or  sonic  multiple  thereof,  was  desirable  in 
view  of  the  fact  that  available  Michigan  and  national  traffic  acci- 
dent and  court  statistics  arc  collected  and  published  on  a calendar 
year  basis.  The  final  sample  is  a statewide  area  probability  sample 
which  is  representative  of  the  universe  of  individuals  as  defined 
above. 


D.  Survey  Design:  An  Overview 

The  survey  design  for  this  study  includes  two  principal  samples, 
one  from  police  files  and  the  other  from  court  records.  The  first, 
which  provides  the  basic  sampling  frame  for  the  entire  study,  is  a 
probability  sample  of  automobile  accidents  which  occurred  in 
Michigan  during  19*8,  and  which  were  reported  to  either  the 
Detroit  Police  Department  or  the  Michigan  State  Police  as  having 
resulted  in  injury  or  death  to  one  or  more  persons.  This  sample 
co»  'ts  of  1118  accidents,  which  resulted  in  2872  individuals 
being  listed  on  police  accident  reports.  The  unit  of  analysis  for 
most  of  the  data  presented  in  the  report  is  the  injured  individual, 
not  the  accident. 

Summary  data  concerning  economic  losses,  injuries,  and  legal 
actions  taken  by  these  individuals  wete  collected  using  a mail 
questionnaire,  with  a mail  and  telephone  follow-up  for  those  not 
responding. 

The  returns  were  grouped  into  three  categories.  The  first  cate- 
goty  included  individuals  for  whom  the  completed  questionnaire 
provided  adequate  data  for  analysis;  i.e.,  in  terms  of  the  purposes 
set  forth  for  the  study,  it  was  felt  that  no  additional  data  would  be 
requited  from  these  individuals.  Persons  in  this  first  category  had 
medical  expenses  of  less  than  $500  and  incurred  no  permanent 
physical  disability  or  permanent  impairment  of  ability  to  work. 

The  second  category  included  individuals  who  cither  died  as  a 
result  of  the  accident  or  who  sustained  complicated  or  "serious” 
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personal  injuries— usually  involving  large  medical  expet.se,  a 
permanent  physical  disability  or  both.  For  most  of  these  individ- 
uals and  their  families,  the  accident  marked  a major  turning  point 
in  their  lives.  Most  of  these  individuals  required  extensive  medical 
treatment,  often  followed  by  job  retraining  or  rehabilitation.  Seri- 
ous financial  difficulties  were  frequently  encountered.  For  such 
persons,  information  provided  by  the  mail  questionnaire  was  not 
sufficiently  detailed  to  petmit  a complete  analysis  of  the  accident 
and  its  consequences.  Accordingly,  these  individuals  were  desig- 
nated to  be  teintetviewed  with  a more  detailed  personal-interview 
schedule. 

The  third  category  of  mail  questionnaires  consisted  of  those 
individuals  who  had  retained  a lawyer  and  who  were  involved  in 
some  kind  of  legal  action  at  the  time  the  questionnaire  was  com- 
pleted. Since,  at  the  time  the  respondents  were  asked  to  complete 
the  mail  questionnaires,  the  accidents  being  asked  about  were,  on 
the  average,  only  two  years  old,  many  of  the  legal  actions  were  just 
being  initiated,  and  very  few  had  been  completed.  Persons  involved 
in  th'  se  legal  actions  would  not  be  in  ft  position  to  give  complete 
cost  and  compensation  data  concerning  their  accidents  until  the 
cases  had  been  settled.  In  fact,  most  of  these  persons  had  been 
instructed  by  their  lawyers  or  insurance  companies  not  to  discuss 
the  case  with  anyone.  Since  many  of  the  cases  would  remain  in 
litigation  fot  another  two  or  three  years,  it  was  decided  to  drop 
this  group  entirely  and  replace  it  with  an  independent  sample  of 
older  ejtomobile  personal-injury  cases,  the  majority  of  which 
would  be  settled. 

In  order  to  fill  the  gap  created  by  this  elimination  and  also  to 
secure  a broader  sample  of  litigated  cases,  the  second  of  the  two 
principal  samples  for  this  study  was  used.  It  was  a probability 
sample  of  automobile  jersonal-injury  suits  filed  in  1957  on  the 
calendar  of  either  a Michigan  Circuit  Court,  the  Kent  County 
Superior  Court,  or  a Federal  District  Court  located  in  Michigan. 
Plaintiffs  in  the  sampled  cases  wctc  used  to  represent  individuals 
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in  the  original  police  sample  who  were  plaintiffs  in  a court  action. 
The  exact  procedure  used  to  substitute  one  sample  for  another  is 
described  in  derail  on  pages  345-50  of  this  chapter,  and  the  results 
of  the  substitution  are  evaluated  in  Chapter  10. 

The  plaintiffs  in  th  : court  suits,  the  respondents  returning  mail 
questionnaires  for  whom  additional  data  were  t:sired,  and  a 
sample  of  those  who  had  not  responded  to  the  mail  questionnaire 
were  combined  into  one  personal-interview  sample.  Personal  in- 
( terviews  were  completed  with  <106  of  the  564  designated  respond- 

I ents;  28  of  the  406  were  victims  of  nonserious  accidents  who  had 

not'rcturned  the  mail  questionnaire. 

The  combined  data  provided  bp  the  questionnaires  outlined 
above  were  considered  sufftciem  to  meet  the  purposes  originally 
outlined  for  the  study.  However,  in  the  course  of  processing  the 
interviews,  it  was  discovered  that  respondents  were  often  unable  to 
provide  certain  types  of  information  asked  for  on  the  question- 
naires. For  example,  in  cases  where  insurance  companies  had  paid 
hospital  and  medical  bills  directly,  the  injured  individual  or  his 
{ family  were  frequently  unaware  of  the  amount  paid.  In  other  cases, 
the  individual  being  asked  about  bad  been  killed  in  the  accident  or 
. had  died  since  the  accident,  and  the  respondent  was  some  other 

member  of  the  family.  Such  resj>ondents  possessed  varying  degrees 
of  knowledge  concerning  the  facts  of  the  accident;  in  some  cases 
they  knew  nothing  about  injury  costs  incurred  and  compensation 
received. 

After  careful  consideration  of  apparent  gaps  in  the  data,  it  was 
deemed  desirable  to  secure  certain  types  of  information — -particu- 
larly financial  data — from  ether  participants  in  the  compensation 
process.  Such  data  would  provide  missing  information  in  some 
cases  and  verification  of  respondents’  leper ts  in  other  cases.  Ac- 
cordingly, a second  research  grant  was  secured  for  these  purposes. 
The  addition •>.}  field  studies  undertaken  are  outlines!  briefly  below 
in  chronological  order. 

The  first  study  included  personal  interviews  with  two  groups  of 
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claimants’  lawyers.  The  first  group  included  all  lawyers  shown  on 
the  sampled  court  records  as  representing  plaintiffs,  whether  or 
not  a personal  interview  had  been  obtained  with  the  plaintiff.  The 
second  group  included  lawyers  hir'.d  by  those  individuals  sampled 
from  police  records  who  had  both  (1)  completed  a personal 
interview  and  (2)  granted  permission  for  their  lawyer  to  be 
interviewed.  There  were  63  lawyers  in  this  second  group.  Claim* 
ants’  lawyers  were  questioned  about  legal  proceedings  and  strategy, 
as  well  as  about  costs  and  compensation  to  both  themselves  and 
their  clients.  A few  of  the  claimants’  lawyers  answered  more  than 
one  questionnaire,  because  they  represented  mote  than  one 
sampled  plaintiff. 

. A second  set  of  questionnaires  was  completed  by  telephone  with 
the  individual  defendants  shown  on  each  sampled  court  calendar. 
The  putjosc  of  these  interviews  was  not  to  determine  the  total 
effect  of  the  accident  on  the  defendant  and  his  family,  but  only  to 
examine  the  process  by  which  the  defendant  undertook  to  defend 
himself  and  the  direct  consequences  of  the  process  itself.  Each 
defendant  was  asked  about  hi;  involvement  in  legal  proceedings 
(including  the  manner  in  which  he  secured  counsel)  and  about 
cny  psychic  or  economic  losses  incurred  by  himself  or  his  family  as 
a result  of  die  suit. 

A third  study  involved  mailing  a two-page  questionnaire  to 
each  lawyer  listed  on  the  sampled  court  records  as  representing  a 
defendant.  Questions  rskeJ  wete  parallel  to  questions  already 
asked  of  plaintiffs’  lawyers,  but  much  fewer  in  number.  They 
were  primarily  concerned  with  the  major  issues  or  sources  of  dis- 
agreement in  the  case  and  the  important  factors  underlying  de- 
termination of  a final  settlement.  As  among  the  plaintiffs,  but  to  a 
much  greater  degree,  some  lawyers  or  law  firms  represented 
several  of  the  defendants  in  the  sample. 

A final  set  of  mail  quc.tionnaires  veas  sent  to  all  hospitals, 
clinics,  or  other  medical  institutions  named  by  personal-interview 
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respondents  as  having  treated  or  cared  for  the  injured  or  deceased 
individuals.  Hospitals  were  asked  to  complete  a separate  one-page 
questionnaire  for  each  discharge  subsequent  to  the  date  of  the 
accident.  Questionnaires  asked  about  the  total  hospital  bill  for  each 
visit,  how  much  of  the  bill  had  been  paid,  and  who  paid  it.  These 
data  have  been  used  mainly  to  examine  reporting  bias. 

Figure  9-1  summarizes  the  above  overview.  In  the  immediately 
following  sections  of  this  chapter,  sample  design,  questionnaire 
content,  and  field  results  for  each  part  of  the  study  will  be  dis- 
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cussed  in  mote  detail,  followed  by  a description  of  the  procedures 
used  to  combine  the  various  parts  of  the  study  for  analysis  pur- 
poses. 

C.  The  Police  Sample 
1.  Sample  Design  Considerations 

As  previously  indicated,  the  universe  for  the  total  study  included 
all  individuals  injured  or  killed  in  automobile  accidents  that  oc- 
curred in  the  State  of  Michigan  during  one  calendar  year.  Before 
describing  the  study  design  actually  used,  a number  of  the  more 
important  problems  encountered  in  making  this  definition  opera- 
tional will  be  outlined. 

An  important  initial  problem  was  that  of  locating  an  apptopri* 
etc  sample  source.  The  "ideal”  sample  source — a complete  listing 
of  Michigan  personal-injury  auto  victims — did  not  exist;  however, 
three  potential  sampling  sources  were  available.  These  were  insur- 
ance company  records,  hospital  reports,  and  police  accident  reports. 

Insurance  company  records,  however,  do  not  include  persons 
injured  in  an  accident  when  the  parties  involved  are  not  insured, 
nor  do  they  include  those  injured  in  accidents  not  repotted  to  the 
insurance  company.  And,  in  addition,  since  individuals  involved  in 
Michigan  accidents  ate  insured  by  many  different  companies  (in- 
cluding some  not  having  an  office  in  Michigan),  it  would  be  diffi- 
cult to  select  a sample  representative  of  the  universe  as  defined  for 
this  study. 

Michigan  hospital  tccords  also  were  found  to  have  a number  of 
serious  deficiencies  for  purposes  of  sampling  the  defined  universe. 
First,  they  list  only  individuals  treated  in  a Michigan  hospital; 
thus  they  exclude  injured  individuals  going  to  a doctor's  office  or 
relying  on  self-treatment.  (The  Michigan  survey  shows  that  15 
pctccnt  of  those  listed  as  injured  but  not  killed  on  police  reports 
stated  that  they  had  not  been  treated  by  a doctor  as  a result  of  the 
accident.)  Second,  the  hospital  reports  would  have  to  be  sub- 
sampled ftom  a representative  sample  of  Michigan  hospitals.  This 
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would  involve  negotiating  for  access  to  files  with  each  sampled 
hospital  on  an  individual  basis.  Policy  regarding  admission  to  files 
varies  from  hospital  to  hospital,  as  do  the  administrative  pro- 
cedures. The  sampling  procedures  within  any  one  hospital  would 
have  to  be  individually  tailored  to  the  recotd-system  used. 

Another  crucial  sampling  problem  within  each  hospital  would 
have  been  that  of  identifying  injuries  resulting  directly  from  auto- 
mobile accidents.  Such  identification  would  have  been  based  on  the 
hospitals  admitting  diagnosis  wlvch  might  or  might  not  make  a 
direct  reference  to  the  automobile  accident  causing  the  injury.  (It 
might  be  expected  that  lack  of  direct  reference  would  be  patticu- 
latly  prevalent  in  cases  where  the  injury  did  not  manifest  itself  for 
some  time  after  the  accident,  or  in  cases  where  numerous  hospitali- 
2a t ions  were  required  for  recovery.)  It  is  conceivable,  if  not  in- 
evitable, that  seriouserror  would  result  from  the  actual  mechanics 
of  reading  and  classifying  entries  on  the  admission  papers.  And 
finally,  in  terms  of  the  sampling  the  defined  univer**,  error  would 
be  introduced  by  the  fact  that  an  individual  injured  a Michigan 
accident  could  be  hospitalized  in  another  state,  and  an  individual 
injured  in  another  state  could  be  hospitalized  in  Michigan. 

For  purposes  of  drawing  a representative  sample  of  individuals 
injured  in  personal -injury  automobile  aecidcnts,  police  accident 
reports  were  considered  deficient  in  only  two  minor  respects.  First, 
not  all  personal-injury  accidents  ate  repotted  to  the  police.  Such 
non-teporting  often  occurs  either  because  no  police  official  is 
immediately  available,  ot  because  the  patties  involved  mutually 
agree  to  settle  their  differences,  often  to  avoid  a ticket.  Sctond, 
many  individuals  do  not  become  aware  that  tb''y  have  been  in- 
jured until  after  the  police  report  has  been  completed  and  all 
parlies  have  left  the  scene  of  the  accident.  This  is  often  the  ease 
with  minor  back  or  internal  injuries.  Even  though  many  of  these 
«codcnts  are  repotted  to  the  police  (as  involving  property  dam- 
age), the  individuals  do  not  appear  cn  the  police  reports  as  having 
been  injured. 
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An  evaluation  of  the  above  and  other  considerations  indicated 
that  the  best  available  sampling  source  would  be  police  accident 
reports.  The  major  reasons  for  this  decision  are  as  follows.  First, 
the  actual  selection  of  the  sample  could  be  accomplished  by  se- 
curing access  to  only  two  sets  of  records.  Personal-injury  and  fatal 
accident  reports  for  the  City  of  Detroit  are  filed  at  Detroit  Police 
Headquarters.  Reports  for  the  rest  of  the  state  are  filed  at  State 
Police  Headquarters  in  East  Lansing.  Second,  the  police  reports 
offered  the  only  sample  source  consistent  with  the  defined  universe. 
Third,  the  police  accident  reports  ate,  by  law,  a matter  of  public 
record.  (Accompanying  documents,  such  as  signed  statements  or 
police  investigation  reports,  ate  not  available  to  the  public.)  A 
o'iscussion  of  the  possible  biases  present  in  such  a sample  will  be 
found  in  Chapter  10;  it  should  be  indicated  here,  however,  that 
the  biases  have  been  evaluated  as  relatively  small. 

■ Before  the  sampling  of  police  reports  could  be  begun  in  Detroit 
and  East  Lansing,  there  remained  the  problem  of  which  particular 
calendar  year  (o;  years)  to  study.  Preliminary  studies  had  indi- 
cated that  a high  percentage  of  all  Michigan  personal-injury 
automobile  accidents  reported  to  the  police  could  not  be  considered 
"scions”  when  measured  by  any  yardstick  (e.g.,  number  of  days 
in  the  hospital,  time  lost  from  work,  property  damage,  compensa- 
tion received,  etc.).  The  large  majority  of  all  personal-injury 
accidents  appeared  to  be  settled  in  a relatively  few  months  after 
the  accident.  For  "minor”  accidents,  it  was  felt  that  a long  time 
span  between  ti  e date  of  the  accident  and  the  date  of  inte'viewing 
would  invite  the  possibility  of  serious  error  from  memory  distor- 
tion; it  seemed  likely  that  respondents  would  not  be  able  to  repott 
accurately  an  event  which  had  occurcd  a number  of  years  in  the 
past,  and  which  was  not  important  to  them.  Indeed,  there  also 
would  be  serious  problems  in  locating  individuals  with  very  old 
address  records. 

The  preliminary  studies  showed  that  a large  fraction  of  the 
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economic  loss  incurred  in  personal-injury  automobile  accidents 
resulted  from  the  relatively  small  number  of  ''serious"  accidents, 
many  of  which  (particularly  those  going  to  court)  required  up  co 
six  or  seven  years  to  settle.  For  the  purposes  of  this  research,  it 
was  essential  that  most  of  the  cases  selected  be  settled  at  the  time 
of  the  interview. 

2.  Final  Sample  Design 

The  final  sample  of  police  personal-injury  reports  is  a statewide 
representative  sample  of  accidents  that  took  place  during  1958. 
Ihe  actual  sampling  of  police  files  resulted  in  the  selection  of 
11)8  accidents,  wnich  included  2S72  individuals  eligible  for  the 
study. 

The  basic  unit  of  analysis  around  which  the  final  research  de- 
sign was  constructed  is  any  individual  listed  on  the  sampled  ac- 
cident reports  as  having  been  injured  or  killed  in  an  automobile 

accident  that  took  place  in  Michigan  during  the  1958  calendar 
year. 

The  sample  of  police  reports  was  drawn  from  two  sources.  The 
Detroit  Police  Department  provided  records  for  nonfatal  per- 
sonal-injury  accidents  that  took  place  within  the  city  limits,  and 
the  Safety  and  Traffic  Bureau  of  the  Michigan  State  Police  pro- 
vided records  for  nonfatal  accidents  taking  place  outside  of 
Detroit  and  for  all  fatal  accidents  in  the  state.  Monfatal  personal- 
injury  accidents  were  selected  using  a sampling  rate  of  one  in 
forty-two;  fatal  accidents  wete  sampled  at  a rate  of  one  in  six. 

lhese  police  repotts  constitute  the  basic  sample  frame  for  this 
Study,  However,  as  indicated  previously,  one  subset  of  eligible  re- 
spondents listed  on  these  reports  was  dropped  for  purposes  of 
inatysis,  and  a substitution  was  made  using  a sample  selected 
from  the  Michigan  courts  The  court  sample  is  discussed  in  detail 
in  ft  subsequent  section  of  this  chapter. 
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D.  Mail  and  Telephone  Screening  Interviews 

1.  Purpose 

Only  a few  personal-injury  automobile  accidents  are  serious. 
For  most  minor  accidents,  answers  to  a few  factual  questions  will 
provide  a relatively  complete  picture  of  the  accident  and  its  re- 
lated costs.  To  avoid  the  high  cost  of  conducting  personal  inter- 
views with  a large  number  of  accident  victims  who  had  only  minor 
injuries  and  who  received  little  or  no  reparation,  a screening 
questionnaire  was  mailed  to  all  individuals  sampled  from  police 
records.  The  questionnaire  was  designed  to  determine  basic  fac- 
tual data  concerning  the  extent  of  both  the  injury  and  the  eco- 
nomic loss  resulting  from  the  accident.  Individuals  who  did  not 
return  the  first  form  were  sent  a second  mail  questionnaire,  and 
those  who  still  did  nor  respond  were  telephoned  by  members  of 
the  Survey  Research  Cen'-r’s  field  staff.  For  an  analysis  of  the  re- 
lability  of  the  screening  questio,-  nairc,  see  Chapter  10. 

2.  Quettionnnire  Content  <wJ  Eligible  Respondents 

The  mail  seteening  questionnaire  accompanied  by  a cover  letter 
was  sent  to  each  eligible  respondent  believed  to  be  living  at  the 
time  of  the  mailing,  and  to  the  next-of-kin  of  individuals  listed  as 
fatalities  on  the  police  reports.  In  general,  questionnaires  asking 
about  deceased  individuals  wete  sent  to  the  "Informant"  listed  on 
the  Death  Certificate.  (In  Michigan,  a photostat  of  the  De-th  Cer- 
tificate for  each  listed  fatality  is  filed  with  the  police  accident  re- 
port.) An  accompanying  letter  explained  the  purpose  of  both  the 
study  and  of  selected  questions  on  the  o’cstionnaire.  It  guaranteed 
the  respondent  anonymity  and  it  instructed  the  respondent  not  to 
answer  certain  questions  if  a lawyer  was  still  working  to  help 
collect  money  in  connection  with  the  accident.  The  questionnaire 
itself  asked  about  medical  treatment  and  medical  expense,  legal 
aid  received  and  legal  expense,  the  number  of  days  lost  from 
work,  valuation  of  property  damage,  whether  Oi  rot  any  perman- 
ent disability  had  been  incurred,  and  the  sources  and  amounts  of 
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reparation  received.  In  the  case  of  fatalities,  additional  questions 
were  asked  about  the  age,  education,  and  income  of  the  deceased 
individual  at  the  time  of  the  accident.1 

The  content  of  the  telephone  questionnaire  was  identical  to  that 
of  the  mail  questionnaire;  only  the  questionnaire  format  was  al- 
tered to  facilitate  asking  questions  and  recording  answers  by 
phone.  It  the  injured  individual  was  under  sixteen  years  of  age 
at  the  time  of  the  interview,  the  interviewer  was  instructed  to 
interview  a parent  or  any  other  responsible  adult  with  knowledge 
of  the  accident. 

3.  Results 

The  results  of  the  initial  screening  are  shown  in  Table  $>-l»  Of 
the  SIS  individuals  by  whom  no  questionnaire  was  completed 
6$  percent  were  never  located,  despite  an  extensive  search  of 
telephone  books  and  city  directories. 

The  screening  questionnaire  was  not  designed  to  provide  com- 
plete information  about  seriously  injured  individuals.  Further  in- 
formation was  to  be  secured  from  them  by  pcrson.it  interview. 

TABLE  9-1 

Sample  Site  and  Response  Rate  fci  Mali 
and  T tit  phone  Screening  Questionnaire 


Number  of 

e/neilionnatres 


NttKibir  of  q.'itjthnrijirt!  <omf>let«{ 

XU4 

fey  mail 

1287 

By  telephone 

161 

iVmkr  cf  nol  ccmpkuJ 

818 

Total  sample 

2872 

Percent  of  questionnaire*  completed 

71>« 

* ftobibin  tbf  focMioft  of  fnost  of  tbs  ictuil  <*o*\jtnsM«  in  tb!i 
rwMith.  Jtormr.  topic*  of  the  wmi  linen,  q.cisiooniirti,  imtrvitvtri*  inv/vc- 
lion*,  oJitinj  ifhl  toJti  for  tt'h  pirt  of  tb«  »h>Jf  hi*t  bttn  micto- 

CfmoJ,  irvJ  topit*  of  ibt  film  t»n  bo  obtiietj  from  ibt  Lbn.-’in,  Instlcvte  for 
Social  lUisirch,  Tbt  Univtni’./  of  M;cbi^io,  At.ft  Allot,  Mkiiifio.  (Ktftrcot*: 
Supple /new  A,  Aotor.v>bi’(  Seudf.) 
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For  purposes  of  this  study,  a "serious”  injury  was  defined  as  one 
which  resulted  in  death,  or  resulted  in  medical  bills  of  $500  or 
more,  or  required  hospitalisation  of  three  weeks  or  more,  or  re- 
sulted in  some  permanent  impairment  of  ability  to  work.  Two 
hundred  ninety-four  of  the  icspondcnts  who  completed  a screening 
questionnaire  were  classified  as  seriously  injured  and  subsequently 
were  designated  for  personal  interviews.  For  the  ''minor”  injuries, 
the  data  provided  by  completed  mail  or  telephone  questionnaires 
were  considered  to  be  sufficient  for  analysis.  Additional  personal- 
interview  respondents  were  forthcoming  from  the  Michigan  court 
sample. 

E.  The  Michigan  Court  Sample 
1.  Sample  Design 

Because  of  the  recentness  of  the  accidents,  the  police  sample  in- 
cluded a group  of  individuals  who  v/ere  stilt  involved  in  some 
form  of  legal  action  at  the  time  of  the  initial  mail  and  telephone 
interviewing.  These  individuals  might  be  expected  to  be  either 
reluctant  or  unable  to  give  complete  cost  and  compensation  data 
concerning  the  accident  until  legal  actions  were  concluded. 
Rather  than  wait  for  completion  of  these  proceedings,  it  was  de- 
cided to  draw  a second  sample  of  older  personal-injury  automobile 
accidents  which  involved  court  action.  How  was  this  substitute 
sample  selected? 

Initially,  it  was  established  that  calendar  year  1957  was  likely 
to  be  the  most  recent  year  for  which  most  cases  commenced  in 
that  year  could  be  traced  to  their  final  disposition.  Therefore  the 
universe  for  the  court  sample  was  defined  as  all  personal-injury 
suits  filed  during  1957  on  the  calendar  of  either  a Michigan  Cir- 
cuit Court,  the  Kent  County  Superior  Court,  or  a Federal  District 
Court  located  in  Michigan. 

In  order  to  sample  this  universe,  a number  of  important  prac- 
tical problems  had  to  be  overcome.  First,  there  are  no  statistics 
available  in  Michigan  which  describe  the  universe  as  defined. 
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Statistics  describing  the  business  of  the  Michigan  Circuit  Courts 
are  collected  and  published  by  the  Administrator  of  State  Courts. 
These  statistics  are  broken  down  only  among  law,  chancery,  and 
criminal  calendars.  There  are  no  figures  available  which  describe 
the  composition  of  civil  litigation  by  type  of  case.  In  order  to 
determine  the  proper  sampling  fractions  for  the  Michigan  court 
sample,  a separate  study,  the  Michigan  Court  Study,*  was  under- 
taken to  examine  the  law  and  chancery  calendars  of  Michigan 
Circuit  Courts. 

To  insure  efficient  use  of  available  funds,  time,  and  research 
personnel,  stratified  probability  sampling  techniques  were  used 
(1)  to  select  twenty-three  counties  in  the  state  and  (2)  to  sub- 
select 2411  cases  from  within  these  counties  in  such  a manner 
that  the  cases  include"'  in  the  sample  would  be  an  unbiased  repre- 
sentative sample  of  all  cases  filed  in  the  state.  Independent 
samples  were  drawn  from  the  law  and  chancery  calendars.  In 
order  that  the  final  sample  include  about  the  same  number  of 
’cases  from  each  calendar,  it  was  decided  to  sample  one  out  of  each 
r":ccen  cases  filed  on  civil  law  calendars  and  one  out  of  each 
t . ity-eight  cases  filed  on  chancery  calendars.  Data  from  the 
court  calendars  were  transcribed  during  February  and  March  ut 
I960. 

The  final  court  sample  included  1226  cases  from  the  chancery- 
calendar  and  1 ’ S5  cases  from  the  civil  law  calendar.  Of  the  civil 
law  cases,  256  were  filed  to  recover  damages  resulting  from  a 
personal-injury  automobile  accident.  Since  the  court  action  for 
all  but  11  of  these  cases  had  been  completed  at  the  time  the 
sample  W3S  drawn,  and  since  256  cases  were  more  than  enough 
for  the  desired  auto  personal-injury  sample,  these  cases  were  desig- 
nated as  the  substitute  sample. 

To  insure  adequate  representation  of  cases  involving  out-of- 


s Fot  a brief  discussion  of  the  results  of  this  study,  see  Alfred  F.  Conard  and 
Charles  E.  Voltz,  ‘The  Economics  of  Injury  Litigation,”  Michigan  State  Bar  Journal 
(August  I9o0). 
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state  plaintiffs  or  the  federal  government,  an  additional  sample 
was  drawn  from  the  three  Federal  District  Court  Divisions  lo- 
cated in  Michigan,  The  calendars  of  these  courts  indicate  the 
cause  for  the  complaint;  consequently,  the  sample  was  taken  from 
cases  indicated  on  the  calendar  as  personal-injury  automobile  acci- 
dent cases.  No  winnowing  of  other  kinds  of  cases  was  necessary. 
Selected  at  a rate  of  one  in  four  automobile  personal-injury  cases, 
the  final  sample  included  53  suits,  of  which  three  were  still  open 
at  the  time  the  sample  was  selected. 

Thus,  the  composite  court  sample  was  made  up  of  309  cases — 
256  from  the  Circuit  Courts  and  the  Kent  County  Superior  Court, 
and  53  from  the  Federal  District  Court  Divisions.  However,  the 
budget  would  not  allow  inclusion  of  all  these  cases  in  the  per- 
sonal-interview sample,  and  a subsampling  procedure  was  re- 
quired. The  cases  were  divided  into  two  groups.  The  first  included 
.cases  where  the  plaintiff  filed  for  damages  in  excess  of  $25,000. 
This  group  consisted  of  123  cases  and  was  included  in  the  per- 
sonal-interview sample  with  certainty,  i.  e.,  all  123  were  desig- 
nared  for  personal  interviews.  The  second  group  included  all  cases 
in  which  the  plaintiff  filed  for  damages  of  $25,000  or  less.  The 
172  cases  in  this  group  were  subsampled  at  a rate  of  one  in  two, 
with  every  other  case  (N  = 84)  being  included  in  the  personal- 
interview  sample.  The  14  “still  open"  cases  were  excluded  from 
the  personal-interview  subsample,  since  it  would  not  be  proper  to 
interview  the  persons  concerned  before  a final  settlement  had  been 
reached.  The  14  cases  are  not  represented  in  the  analysis.  The 
final  personal-interview  sample  included  207  court  cases.  Table 
9-2  summarizes  the  composition  of  the  court  sample.  To  the  ex- 
tent that  t)ie  14  “still  open”  cases  are  atypical,  a bias  is  introduced. 
A follow-up  could  be  completed  on  these  cases  at  some  future 
date. 

The  reader  should  be  aware  of  one  important  methodological 
problem  associated  vdth  combining  data  from  the  police  and 
court  samples.  Michigan  police  reports  include  only  accidents 
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that  occur  within  the  geographic  limits  of  Michigan.  However, 
some  Michigan  accidents  involve  vehicles  from  other  states;  and 
it  is  possible  that  a Michigan  resident  would  choose  to  sue  3n 
out-of  state  motorist  in  the  state  of  the  latter's  residence.  On  the 
other  hand,  any  sample  of  automobile  accident  cases  filed  in 
Michigan  courts  will  include  some  suits  resulting  from  accidents 
which  did  not  take  place  in  Michigan.  However,  in  this  study  both 
of  the  groups  described  are  small  relative  to  the  total  sample,  and 
it  is  highly  unlikely  that  any  extensive  biases  are  introduced  by 
making  the  assumption  that  court  cases  filed  in  Michigan  resulting 
from  accidents  occurring  outside  the  state  are  equivalent  to  out-of- 
state  court  actions  arising  from  Michigan  accidents.  For  purposes 
of  the  analysis  herein,  this  assumption  has  been  made. 

The  procedures  used  to  substitute  plaintiffs  from  the  court 
sample  for  plaintiffs  from  the  police  sample  are  described  later 
in  this  chapter,  and  the  effectiveness  of  the  procedures  is  evaluated 
in  Chapter  10. 

F.  Personal  Interviews  with  Plaintiffs  and  Personal 
Interviews  with  Mail  and  Telephone  Respondents 
Reporting  "Serious”  Injuries 

1.  Purpose 

Personal-interview  questionnaires  were  designed  to  secure  in- 
formation concerning  the  consequences  of  serious  or  complicated 
automobile  accidents,  such  as  those  involving  death,  serious  in- 
jury, extended  litigation,  and  large  income  losses. 

The  564  respondents  designated  for  personal  interviews  tvere 
selected  from  three  sources.  First,  there  were  292  "serious"  cases 
from  the  mail  and  telephone  screening  interviews.  Second,  there 
were  207  plaintiffs  from  personal-injury  automobile  suits  filed 
during  1957  on  calendars  of  Michigan  Circuit  Courts,  the  Kent 
Count)'  Superior  Court,  or  one  of  the  three  Federal  District  coutt 
E ivisions  located  in  Michigan.  And  third,  in  order  to  learn  some- 
thing about  the  characteristics  of  persons  who  did  not  return  a 
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mail  or  telephone  questionnaire,  a subsarnple  of  <56  cases  was 
selected  from  those  not  responding  to  the  screening  questionnaire. 
Since  extensive  effort  had  been  already  made  to  complete  mail 
or  telephone  questionnaires  with  this  group,  it  was  evident  that 
additional  efforts  to  secure  these  interviews  would  represent  a 
relatively  inefficient  use  of  resources;  consequently,  the  sample 
selected  was  intentionally  small. 

2.  Questionnaire  Content  and  Eligible  Respondents 

In  order  to  interview  all  eligible  respondents,  two  different 
personal-interview  schedules  were  requited.  An  "A"  questionnaire 
was  designed  to  be  used  whenever  the  injured  person,  himself, 
was  being  interviewed;  and  a "B”  questionnaire  was  designed  for 
situations  where  someone  other  than  the  injured  person  was  being 

interviewed,  e.  g.,  when  the  injured  person  was  a minor  child  or 
deceased.3 

For  cases  sampled  from  court  calendars,  interviewers  were  in- 
structed to  talk  only  with  the  plaintiff.  No  substitutions  were 
allowed  here.  (Of  course,  the  plaintiff  may  or  may  not  have  been 
the  injured  person.) 

tor  cases  sampled  from  police  records,  interviewers  were  in- 
structed to  talk  only  with  the  injured  person,  with  the  following 
exceptions.  If  the  injured  person  had  died  prior  to  the  interview 
(regaidless  of  the  cause  of  death),  the  interviewer  was  allowed 
to  interview  an  adult  member  of  the  household  at  the  time  of 
the  accident  who  had  a reasonable  knowledge  of 'the  facts.  Sub- 
stitution was  also  allowed  if  the  injured  person  was  twenty-one 
years  old  or  younger  at  the  time  of  the  accident.  Eligible  respond- 
ents for  these  cases  were  classified  into  two  groups.  If  the  person 
injured  was  sixteen  years  old  or  younger  at  the  time  of  the  acci- 
dent, the  interview  was  obtained  with  either  a parent  or  guardian. 
No  other  substitution  was  allowed  for  these  cases.  However,  if 
the  injured  person  was  between  seventeen  and  twenty-one  years 
3 

Footnote  omitted. 
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old  (inclusive)  at  the  time  of  the  accident,  the  eligible  respondent 
could  be  either  the  injured  person  or  a parent  or  guardian,  or  the 
injured  person’s  spouse. 

In  cases  where  individuals  other  than  the  injured  person  qual- 
ified as  eligible  respondents,  every  effort  was  made  to  complete 
the  interview  with  the  eligible  respondent  most  knowledgeable 
about  the  facts  of  the  accident 

3.  Results 

A summary  of  sample  sizes  and  response  rates  for  each  of  the 
personal-interview  groups  is  presented  in  Table  9-3. 

As  shown  in  Table  9-3,  personal  interviews  were  completed 
with  33  of  the  66  individuals  in  the  nonrespoi.se  subsample.  Two 
alternatives  were  available  for  using  these  completed  interviews  in 
analysis.  First,  they  could  be  used  to  represent  the  entire  mail  and 
telephone  nonresponse  (that  is,  the  181  cases  which  did  not  re- 
spond to  the  screening  questionnaire).  Second,  they  could  be 
used  as  a basis  for  completing  a mail  or  telephone  question- 
naire for  each  respondent,  which,  in  turn,  would  thei.  be  used  in 
the  same  manner  as  the  original  mail  and  telephone  response, 
that  is,  to  determine  if  the  injury  was  "serious”  enough  to  qualify 
the  respondent  fo:  a personal  interview.  The  small  subsample 
and  the  large  weights  that  would  be  inv<~’  red  in  the  first  alter- 
native suggested  that  a serious  bias  might  be  introduced;  conse- 
quently, the  second  alternative  was  chosen.  Of  the  33  completed 
interviews,  5 were  "serious”  and  were  included  with  the  other 
"serious’’  personal  interviews  for  weighting  and  analysis.  The  re- 
maining 28  interviews  were  not  "serious”  and  have  not  been  used 
in  the  detailed  analysis  of  personal  interviews;  but  they  have  been 
used  in  conjunction  with  the  mail  and  telephone  returns  for  the 
analysis  of  all  cases,  thereby  improving  the  response  rate.  The 
"adjusted”  personal-interview  sample,  i.e.,  the  one  used  for  anal- 
ysis, is  presented  in  Table  9-4. 
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G.  Computation  of  Weights  for  Police  and  Court 
Records,  and  Mail,  Telephone,  and  Personal  Interviews 

Weighting  is  necessary  when  the  cases  or  individuals  in  the 
universe  have  different  chances  of  being  drawn  into  the  sample.  In 
this  study,  for  example,  the  sample  of  police  records  consists  of 
every  forty-second  nonfatal  accident  record  and  every  sixth  fatal 
accident  record.  This  means  that  every  nonfatal  accident  selected 
represents  itself  and  41  other  nonfatal  accidents  and  every  fatal 
accident  represents  itself  and  five  other  fatal  accidents.  To  esti- 
mate a statistic  for  the  state  as  a whole,  data  from  each  non-, 
fatal  accident  record  must  be  multiplied  by  forty-two  and  data 
from  each  fatal  accident  record  must  be  multiplied  by  six.  (See 
column  2,  Table  9-5.) 

A second  weighting  step,  weighting  for  nonresponse,  may  be 
demonstrated  by  referring  to  columns  4 and  5 of  Table  9-5.  Using 
the  first  row  as  3n  example,  note  first  that  every  forty-second  non- 
fatal accident  case  was  sampled.  An  attempt  was  then  made  to 
complete  a mail  or  telephone  interview  with  each  of  the  1462 
individuals  injured  in  these  accidents.  If  interviews  bad  been 
completed  with  all  1462  people,  the  data  on  each  completed 
interview  would  have  a weight  of  forty-two.  However,  only  1075 
respondents  (73-5  percent)  actually  completed  and  returned  a 
' ucstronnaire.  These  questionnaires  were  then  "weighted  up”  to 
represent  all  individuals  injured  in  Michigan  nonfatal  accidents 
during  1958.  Thus,  the  original  sample  contained  1462  individ- 
uals, each  representing  himself  and  forty-one  other  injured  per- 
sons. After  the  mail  and  telephone  survey,  the  sarnie  number  of 
injured  persons  (42  x 1462)  was  represented  by  1075  com- 
pleted interviews — each  respondent  representing  himself  and  57 
(42  x 1462  -f-  1075)  other  persons  injured  in  nonfatal  accidents. 
The  use  of  identical  weighting  procedures  in  all  subgroups  would 
implicitly  assume  that  all  nonresponse  cases  were  alike  and  equal 
to  the  average  of  the  responses.  Such  a procedure  would  be  valid 
if  the  nonrespondents  were  similar  to  the  respondents  in  regard  to 
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variables  important  for  the  study.  To  the  extent  that  the  two 
groups  differ,  potential  bias  exists,  and  weighting  within  sub- 
groups ran  be  expected  to  reduce  it.  Evidences  of  bias  in  this  study 
are  discussed  in  a later  section  of  this  book.  (See  Chapter  10.) 

The  same  reasoning  may  be  applied  to  the  remaining  sttvta 
in  Table  9-5  as  well  as  to  the  additional  tables  in  this  chapter, 
v/hich  show  the  response  rates  and  weights  for  the  remaining  parts 
of  the  study. 

If.  Substitution  of  Personal  Interviews  Completed 
with  Plaintiffs  from  the  Court  Sample  for  Plain  tiffs 
from  the  Police  Sample 

1.  Procedure  for  Designating  Plaintiffs  from  the  Police  Sample 

In  order  to  substitute  the  personal-interview  court  sample  for 
plaintiffs  from  the  mail-telephone  study  (Sec  Figure  9*1,  supra), 
it  was  essential  to  know  which  of  the  2054  mail  or  telephone 
respondents  had  actually  filed  a suit.  Preliminary  drafts  of  the 
questionnaire  had  asked  whether  a suit  had  been  filed,  but  exper- 
ience gained  in  pretests  showed  that  respondents  frequently  did 
not  know  whether  a suit  had  been  filed  or  not.  All  they  knew  was 
that  they  had  put  the  matter  in  the  hands  of  "my  lawyer,"  or  "my 
insurance  company.”  The  project  staff  therefore  decided  to  ask  the 
named  lawyer  or  insurance  company  for  suit  information.4 

It  is  recognized  that  an  insurance  company  cannot  properly 
represent  a policyholder’s  personal-injury  claim.  However,  it  is 
probable  that  the  counsel  who  represents  the  liability  insurer  of 
the  injured  person,  or  of  the  owner  of  the  car  in  which  the  in- 
jured person  was  driving  would  know  about  any  suits  filed  in  con- 
nection with  the  accident.  The  injured  person  frequently  does  not 
perceive  the  counsel  and  the  insurance  company  as  two  different 
organizations.  This  would  be  especially  likely  if  two  car  owners 

4 In  view  of  the  difficulties  encountered  in  using  this  procedure,  it  may  veil  be 
that  in  future  studies  the  individuals  should  be  asked  directly  whether  a suit  had 
been  filcJ*  and  additional  information  sought  only  in  cues  where  the  individual 
cannot  provide  the  necessary  data. 
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are  making  claims  against  each  other.  Because  of  these  consider- 
ations, the  project  staff  decided  to  make  inquiry  of  the  insurance 
company  named  as  "handling"  the  claim,  rather  than  explaining 
to  the  injured  petson  that  he  cannot  be  represented  by  an  insur- 
ance company  and  seeking  to  get  from  him  the  name  of  the  law- 
yer involved. 

Before  the  lawyers  or  insurance  companies  were  approached,  the 
2054  completed  mail  and  telephone  questionnahes  were  reviewed 
to  determine  which  respondents  could  be  reasonably  classified 
as  potential  plaintiffs.  The  completed  schedules  clearly  indi- 
cated that  1330  respondents  had  not  been  plaintiffs.  The  large 
majority  of  this  group  were  eliminated  because  they  answered 
"No”  .hen  asked,  "Did  you  put  your  case  (was  the  case  put)  in 
the  hands  of  a lawyer  or  insurance  company?”  The  problem,  then, 
was  to  determine  which  of  the  remaining  724  respondents  had 
filed  as  plaintiffs  in  a Michigan  Circuit  or  Federal  District  Court. 

If  the  name  of  the  lawyer  or  insurance  company  had  been  pro- 
vided by  the  respondent,  a form  letter  accompanied  by  a return 
postcard  (shown  on  the  next  page)  was  mailed  to  the  designated 
party.  The  letter  described  the  study’s  purpose,  guaranteed  the 
respondent  anonymity,  and  explained  both  why  it  was  necessary 
to  know  the  information  requested  on  the  postcard  and  why  it 
was  felt  that  more  accurate  answer*  would  be  obtained  from 
lawyers  and/or  insurance  companies  than  from  the  sampled  in- 
dividuals. 

Postcards  returned  by  lawyers  or  insurance  companies  were 
sorted  into  three  groups.  First,  if  the  completed  postcard  stated 
that  the  individual  inquired  about  had  not  been  a party  to  liti- 
gation of  any  kind,  or  had  beet,  a party  to  a suit  brought  in  a 
lower  court  (but  had  not  been  involved  in  litigation  brought  in 
either  a Circuit  or  Federal  District  Court),  or  had  been  a <fe- 
feiuftnt  in  a suit  filed  in  either  a Circuit  or  Federal  District  Court, 
these  cards  were  set  aside.  For  purposes  of  the  sample  substitution, 
it  was  considered  that  these  persons  were  not  plaintiffs  in  litiga- 
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(Pie? sc  cri;dc  the  correct  answer)  SRC  No. 

1.  Was  the  individual  (named  on 
accompanying  letter)  either  a 
plaintiff  or  defendant  in  a 
suit  filed  on  a Michigan  court 

calendar?  Yes  No 

IF  YES  2.  Plaintiff  or  defendant?  P D 

TO  {1 

3.  In  what  court  was  the 
suit  filed? 

d.  When?  Month  Year 

THANK  YOU 

. Postcard  mailed  to  lawyers  and/or  insurance  companies 
named  by  mail  or  telephone  respondents 

tion  brought  in  either  a Circuit  or  Federal  District  Court.  Second, 
if  die  coinplctcd  postcard  stated  that  the  individual  inputted  about 
was  a plaintiff  in  a Circuit  or  Federal  District  Court  case,  this  was 
considered  conclusive  and  the  individuals  represented  by  these 
cards  became  the  nucleus  of  the  police  sample  group  that  would 
be  dropped  in  favor  of  the  court  sample.  Third,  if  the  name  of  a 
lawyer  or  insurance  company  was  not  given  by  the  original  re- 
spondent, or  if  the  card  was  not  returned  by  the  lawyer  or  insur- 
ance company,  or  if  the  lawyer  or  insurance  company  could  not 
provide  the  information  requested,  a form  letter  and  postcard 
(similar  to  the  card  sent  to  lawyers)  was  sent  to  the  original  re- 
spondent. The  cover  letter  thanked  the  individual  for  having  pre- 
viously completed  cither  the  mail  or  telephone  questionnaire  and 
explained  that  the  additional  information  was  needed  in  order  to 
develop  a complete  statistical  picture  of  the  results  of  automobile 
accidents  in  Michigan.  The  postcards  returned  by  .Ms  group  were 
treated  in  the  same  manner  as  those  returned  by  the  lawyers  or 
insurance  companies. 

The  results  of  the  mailing  (at  a cut-off  date  two  weeks  after  the 
last  letter  had  been  mailed)  are  shown  in  Table  9-8.  The  fart 
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that  the  numbet  of  plaintiffs  exceeds  the  number  of  defendants 
might  be  explained  in  a number  of  ways.  First,  it  could  be  argued 
that  a defendant  who  is  also  the  plaintiff  in  a counterclaim  would 
be  more  likely  to  return  the  postcard  marked  "plaintiff/’  Second, 
for  accidents  involving  a number  of  individuals  or  vehicles,  there 
are  often  more  plaintiffs  than  defendants,  i.e.,  if  an  individual 
is  generally  considered  to  be  at  fault,  it  is  probable  that  he  will  be 
sued  by  a number  of  other  persons.  Third,  it  might  be  expected 
that  mail  or  personal  inquiries  would  present  less  of  a psycholog* 
ical  threat  to  plaintiffs  than  to  individuals  who  have  been  accused 
of  being  responsible  for  the  accident.  And  finally,  although  there 
may  be  a fifty-fifty  chance  that  an  injured  person  was  at  fault, 
there  is  a considerable  chance  that  he  did  no  damage  to  the  other 
party,  cither  because  collision  was  with  a fixed  object  (e.g.,  tree, 
ditch,  or  parked  car)  or  because  the  other  vehicle  was  less  Mil* 
ncrable  (eg.,  truck  or  railtoad).  Also  it  is  well  known  that  a 
seriously  injured  person  is  less  likely  to  be  sued  because  (1)  the 
jury  will  sympathize  v/ith  him,  or  (2)  he  is  impoverished  by  the 
accident  and  can’t  pay. 


TABLE  9-8 

Designation  of  Plaint'iQi  in  Police  Sample:  Remits  of  Mailing 
to  Lawyers,  Imuranee  Companies,  and  Initial  Respondent! 


RtJulss  o f Mailing 

\ 

H*nbtr 
of  easts 

Percent 

o / tampU 

Postcard  CHtilicfituirt  teas  tt/utned 

)70 

>1% 

Pl&tntitT  in  Circuit  ot  Federal 

Court 

58 

5 

Defendant  in  Circuit  ot  Federal 

District  Coutt 

20 

5 

Patty  to  t suit  brought  in  t lower 

coutt 

21 

5 

Not  a patty  to  litigation 

256 

55 

Lawyer  or  insurance  company  couW  not 

provide  the  information  requested 

54 

5 

Pcxt&rd  junlicnviite  utt  vet  teUtKtd 

355 

49 

Total 

724 

100% 
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.Jab,f  9‘8  Sj°WS  that  in  34  cases  ‘he  postcards  were  returr-ed 

semZ^To’rfre  “J "'f ^ 355  Cas”'  »*  were  no, 

«ked  atone  Vri?  ^ A‘lrCj  iubs',',ulion.  a«  3S9  individuals 
asked  about  0n  these  questionnaires  had  to  be  "assigned"  to  one 

of  the  , hire  following  caiegories:  (i)  plaintiff  ctn.it  ot  Fed. 

Court  o,Tlf°U"'  <2)  *fcndant  Qk“'<  or  Federal  District 

* !UI''  UU£hl  in  2 '°™  ““  « ~ * 

inform  a.ss!*nracms L"-c'c  »“*  by  first  examining  all  available 
nformat'on  on  each  rase,  and  where  it  seemed  highly  likely  ,h„ 

wtied  to  fhn0b^n  ' »"*  “ * MMW 

sJ  * J S.  d 1°  thc  thlfd  ca'eS°r>’-5  (Ten  individuals  were  as- 

avaidsWrem  °"  ,l,c  ba,is  of  i»f«"a.ion 

i , ,e  CntJfe  stuc^*  CfJfer^  were  selected  that  could 

be  used  associate  an  individual  with  one  of  the  three  «. 

S tics,  or  example,  all  individuals  arrested  or  cited  for  a viola. 

c"  ZTf  *?***  «W  on  the  basis  of  the 
fact  that  the  large  majority  of  defendants  who  did  return  a Dost. 

ard  questionnaire  had  been  a, tested  or  cited  for  a ZL  J2.  ' 

Id,'  “ I’1  ^ r,mtCt  wiaM«  «'0  used  to  assign 
each  case,  and  variables  that  were  best  able  to  discriminate  among 

the  three  categories  were  given  more  weigh,  in  classifying  2 

S »■«  continued  un.il  every  in- 

ditidual  had  been  included  in  one  of  the  three  categories.  Of  the 

samole  v‘V  ’d’  • *7°''“  !an,p,e  '°5'  or  37  £r«ne  of  the 
sample,  were  designated  as  plaintiffs  in  a Michigan  or  Federal 

Distiic,  Court  Thirty-eight  of  these  had  returned  JpLtca'd 

questionnaire.  Eleven  were  designated  plaintiffs  as  a mult  of 

m o mation  available  from  the  personal  i„,e, views,  andZst 

maining  5ff  were  assigned  using  the  ptocedu.e  just  ml, lined 

tiffs  ftomX  ol"  F,£“te?’1,  ,M  ana,,s“  purposes  the  105  plain- 
tiffs ftom  the  lol.ee  sample  were  dropped,  and  the  plaintiff,  from 

lluo  ii^wsemj  htre^"1  ,CpWI  describts  its  iiiisnrocm  protrdvre  to  rcore  <f(„n 
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the  court  sample  were  substituted  for  them.  Both  die  logic  under- 
lying this  type  of  sample  substitution  procedure  and  the  effective- 
ness of  the  substitution  in  this  particular  study  are  discussed  in 
Chapter  10.  * 

I.  Personal  Interviews  with  Claimants  Lawyers 

1.  Purpose 

The  claimants'  lawyers'  study  was  the  first  study  completed 
under  a grant  from  the  Walter  E,  Meyer  Research  Institute  of 
Law.  As  specified  in  the  research  proposal,  the  additional  grant 
was  obtained  to  pursue  three  principal  objectives — first,  to  obtain 
data  on  heretofore  unstudied  questions,  such  as  the  legal  prob- 
lems that  hinder  prompt  settlement  of  claims;  second,  to  verify 
financial  information  already  obtained  from  the  initial  personal 
interviews;  and  third,  to  provide  information  on  methodology 
which  would  enable  others  to  make  future  surveys  of  auto  accident 
compensation  with  increased  efficiency  at  a lower  cost.  The  claim- 
ants’ lawyers’  study,  along  with  the  studies  that  will  be  desetibed 
in  the  immediately  following  sections  of  this  chapter,  was  de- 
signed to  be  combined  with  data  obtained  in  the  first  part  of  the 
study  to  achieve  these  objectives. 

2.  Questionnaire  Content  and  Eligible  Respondents 

Claimants’  lawyers  were  questioned  about  legal  actions  taken 

On  behalf  of  their  clients  as  well  as  about  the  costs  and  compensa- 
tion to  both  themselves  and  their  clients.  Following  a detailed 
discussion  of  the  specific  case  sampled,  the)-  were  asked  a number 
of  general  questions  concerning  their  views  about  the  way  auto- 
injuty  cases  presently  ate  handled  in  Michigan  and  about  the 
problem  of  court  delay  in  Michigan, 

The  claimants’  lawyers’  sample  was  designed  so  that  it  could  be 
treated  as  an  independent  study  or  combined  with  other  patts  of 
the  total  study  for  case  analysis.  To  be  mote  specific,  claimants* 
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lawyers  were  selected  from  two  sources.  First,  all  the  plaintiffs' 
lawyers  listed  on  the  sampled  court  calendars  were  included  in 
the  sample,  regardless  of  whether  or  not  an  interview  had  been 
completed  with  their  client.  Second,  lawyers  hired  by  personal- 
interview  respondents  from  the  police  sample  were  included  only 
if  their  clients  had  given  permission,  cither  verbally  or  in  writing, 
for  the  interview  to  take  place.  The  composition  of  the  claimants’ 
lawyers'  sample  is  shown  in  the  first  column  of  Table  9-9- 

A further  word  should  be  said  about  the  request  for  permission 
to  interview  lawyers.  Each  of  the  37S  personal-interview  respond- 
ents who  hired  a lawyer  was  asked  to  sign  a waiver  of  confidential 
privilege  giving  permission  for  his  lawyer  to  disclose  costs  and 
legal  issues  in  the  case*  If  the  personal-interview  respondent  was 
reluctant  to  sign  the  waiver,  he  was  asked  to  give  verbal  permis- 
sion for  his  lawyer  to  be  interviewed.  And  if  either  verbal  or  writ- 
ten permission  was  given,  he  was  asked  for  the  name  and  address 
of  his  lawyer.  Scvcnty-onc  percent  of  those  asked  to  sign  the 
waiver  did  so,  and  87  percent  of  the  lawyers  listed  on  the  waiver 
agreed  to  be  interviewed.  This  compares  with  an  overall  response  . 
rate  of  77  percent  for  all  lawyers.  (The  precise  sequence  of  ques- 
tions is  shown  in  Appendix  E,  questions  F-14  to  F-16.)  A detailed 
evaluation  of  the  effectiveness  of  waivers  in  securing  interviews 
with  lawyers  is  presented  later  in  this  chapter. 

Returning  to  Table  9-9,  consider  again  the  two  groups  of  el- 
igible respondents.  The  207  plaintiffs'  lawyers  who  were  listed 
on  the  sampled  coutt  calendars  comprise  a probability  sample 
lepresentative  of  all  lawyers  filing  on  behalf  of  plaintiffs  in  Mich- 
igan personal-injury  automobile  accident  suits  during  one  calendar 
year.  For  purposes  of  estimating  state  aggregates,  each  of  these 
cases  need  only  be  weighted  in  accordance  with  both  its  initial 

•■fix  t»J<  of  * wiiff  * »i  ii’££«rcd  r«  of  iSf  tdritetr  commute  tt  < 

of  the commute*  in  April,  1£60.  The  comromee  felt  ihst  a refm* 

from  a lawyer*!  tlicot  wojW  tubuintiilly  irxftiK  the  o^ettU  re<poo?£  rare  for 
tXU  part  of  ihe  irv^r.  The  waiter  **»  pmrirei  in  i^tivkai  with  inpz/oJ 
IfxirrWjah  or  fntnUri  of  ihtii  fimiKcs  arxj  i?s  u«  aura  I to  c£er  do  significant 
I tut  rmwioj  problem** 
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chance  for  inclusion  in  the  sample  (i.e.,  the  sampling  interval  used 
to  select  the  court  calendars)  and  the  personal-interview  response 
rates.  The  response  rates  and  determination  of  interview  weights 
for  this  group  are  shown  in  Table  9-10. 

The  second  group  of  eligible  respondents  shown  in  Table  9*9, 
the  63  lawyers  hired  by  personal-interview  respondents  from  the 
police  sample,  docs  not  comprise  a representative  sample  of  any 
larger  group.  Therefore,  the  38  completed  questionnaires  from 
this  group  have  not  been  weighted  nor  have  they  been  included 
in  any  of  the  quantitative  material  presented  in  earlier  chapters; 
they  have  been  used  only  for  individual  case  studies.  The  quanti* 
tative  material  in  this  report  is  based  entirely  on  the  1 49  ques- 
tionnaires completed  with  the  plaintiffs'  lawyers  listed  on  the 
sampled  court  calendars. 

3-  Results 

The  response  rates  tor  the  two  claimants’  lawyers’  groups  arc 
shown  in  Table  9-9-  The  questionnaire  completion  rate  is  notice- 
ably higher  for  the  lawyers  hired  by  personal-interview  respond- 
ents from  the  police  sample  than  for  plaintiffs’  lawyers  listed  oh 
the  sampled  court  calendars.  A comparison  of  the  two  nonresponse 
groups  explains  the  difference.  Table  9-11  shows  why  interviewers 
were  unable  to  complete  interviews  with  58  of  the  plaintiffs’  law- 
yers listed  on  the  court  calendars.  Notice  that  twenty-six  of  the 
nonresponse  interviews  (44.$  percent)  were  never  attempted. 
Nineteen  of  these  "automatic  nonresponse”  were  lawyers  whose 
clients  had  cither  refused  to  be  interviewed  or  had  asked  that  no 
interview  be  attempted  with  their  lawyers;  although  it  was  felt 
that  interviews  should  not  be  attempted  with  the  lawyers  in  these 
cases,  the  cases  were  included  in  the  statewide  plaintiffs'  lawyers’ 
sample  by'  definition. 

Ten  additional  plaintiffs’  lawyers  declined  to  complete  the  ques- 
tionnaire because  they  could  not  remember  details  of  the  case 
and  the  case  file  could  not  be  located.  For  all  10  of  these  cases, 
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the  accident  inquired  about  had  occurred  at  least  4 years  prior  to 
the  interview.  The  two  groups  just  discussed  account  for  62.1 
percent  of  the  nonresponse.  The  remaining  37.9  percent  of  the 
nonresponses  are  classified  by  reasons  common  to  all  field  surveys 
— they  were  cither  refusals  or  cases  where  the  designated  respond* 
ent  w'as  absent  from  the  state  or  country  during  the  interviewing 
period. 

For  comparison,  a number  of  explanations  might  be  suggested 
for  the  extraordinarily  high  proportion  of  questionnaires  that 
were  completed  with  lawyers  hired  by  personal-interview  respond- 
ents from  the  police  sample.  First,  compared  with  the  court 
sample,  the  rcccntncss  of  the  accidents  considerably  reduced  the 
chance  of  the  file  not  being  available  ot  of  the  lawyer  not  being 
able  to  remember  the  case.  Second,  in  all  the  cases  included  in 
this  group,  the  lawyer’s  client  had  previously  been  interviewed 
and  the  client  had  given  his  verbal  or  written  consent  for  his 
lawyer  to  be  interviewed.  It  will  be  shown  shortly  that  -securing 
permission  from  the  client  did  increase  the  tesponse  rate  with 
lawyers.  A summary  of  the  reasons  given  for  the  five  nonresponses 
in  this  latter  group  is  shown  in  Table  9*12. 


TABLE  9-12 

Rcjion  Why  Qt/tifioiu:tire  Plot  Completed 
uilb  Chimeni'i  Leuytr,  Polite  Staple 


Rc4ioit  uhy  quettionnthe  u*t  not 
completed 

Number  of  qr/csthrituiret 
w ot  completed 

Interview  never  attempted 

(automatic  nonresponse) 

2 

Interview  refused  by  respondent 

1 

Interview  prevented  by  unavailability 

of  respondent's  case  file 

1 

Other  (respondent  is  not  ?n  attorney, 

although  he  did  gise  claimant  legal  advice) 

1 

Total  quest  ionn  si  res  not  Completed 

5 
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4.  A Note  Concerning  the  Effectiveness  of  Waivers 

Tables  9-13  and  9-1 4 indicate  that  the  securing  of  a client’s 
permission  to  interview  his  lawyer  does  substantially  increase  the 
probability  of  completing  an  interview  with  his  lawyer.  In  Table 
9-13  the  bottom  row  (which  shows  the  response  rate  for  inter- 
views that  were  assigned  to  the  field,  ie.,  the  results  for  all  cases 
where  an  interview  was  actually  attempted)  indicates  that  about 
90  percent  of  the  interviews  were  completed  when  the  inter- 
viewer had  a signed  waiver  available  to  show  the  lawyer,  that 
about  87  percent  of  the  interviews  were  completed  when  the 
interviewer  could  state  that  the  client  had  given  his  verbal  per- 
mission for  the  interview  to  take  place,  but  that  only  about  72 
percent  of  the  assigned  interviews  were  completed  when  the 
client’s  permission  had  not  been  secured.1 

Looking  at  both  Tables  9-13  and  9-14,  it  appears  that  the  fact 
of  having  permission  is  more  important  than  the  form  in  which 
the  permission  is  given.  There  is  essentially  no  difference  between 
the  response  rates  for  lawyers  who  were  presented  with  a signed 
statement  and  lawyers  who  were  simply  told  that  their  client  had 
granted  permission  for  the  interview.  Moreover,  Survey  Research 
Center  interviewers  were  instructed  to  leave  the  waiver  with  the 
lawyer  if  he  so  requested;  most  lawyers  gave  the  statement  merely 
superficial  attention  and  returned  it  to  the  interviewer.  From  the 
evidence  presented  here,  one  would  conclude  that  securing  a 
claimant’s  permission  to  interview  his  lawyer  did  contribute  to 
the  overall  response  rate;  but  the  evidence  does  not  suggest  that 
the  use  of  a waiver  produced  significantly  better  results  than  did 
the  securing  of  verbal  permission  only. 


he  » rcffcJ,  however,  ih.t  8 of  the  )l  rx>mc<poodeeit  in  ,)!, 
third  group  (Soc  Ttb.*  9-11.)  ro!d  the  inun.cv-cr  th»t  ih<f  «vuld  complete  t 
<jv»!iw>r.j.re  if  the  signed  rcleise  could  be  obcifncd.  ft  is  cntiielf  possible  tb« 
8 snorne.-i  might  have  completed  if*  qucstiotiniite  if  th*  issue  of  %tiun 
f*d  •**  h«™  tir|*!i»i*cd  in  both  the  imrodunotj  lertei  to  the  l»«Vef  trJ  in  the 
ntfodBOK>n  io  the  re+K^l  interview.  If  this  hij  been  the  t.se,  the  response  met 
in  ihethtn  ftoop*  would  hive  ken  tlmott  On  the  othet  hind,  it  it  eou»II» 
fwjihle  thst  lie!  of  permission  in  one  fotm  ot  ibe  c:he»  could  hive  sebstiMisUr 
(educed  the  rtspo-.s*  ti:<  in  the  first  t«o  groups. 
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J.  Telephone  Interviews  with  Dependants 
1.  Introduction 

A second  empirical  study  completed  under  the  Meyer  Grant  in* 
volvcd  the  completion  of  telephone  questionnaires  with  defendants 
listed  on  the  sampled  court  calendars.  In  developing  the  research 
design  for  this  part  of  the  projeer,  a number  of  problems  were 
encountered,  the  mo.st  important  of  which  are  discussed  briefly 
here. 

Corporations  named  as  defendants  were  excluded  from  the 
sample  for  a number  of  reasons.  First,  the  interviewing  schedule 
designed  to  be  completed  with  individual  defendants  was  not  ap- 
propriate for  interviewing  corporation  officials,  and  there  were  not 
enough  corporations  in  the  sample  to  justify  a separate  analysis. 
A less  important  reason  involved  the  practical  difficulty  of  locating 
the  individual  within  a corporations  claims  office  who  handled 
a particular  case.  More  important,  if  the  individual  could  be  lo- 
cated, the  information  asked  of  him  would  have  been  essentially 
the  same  as  that  req  .ested  on  the  defendants'  lawyers'  question- 
naire for  the  same  case.  In  fact,  in  many  cases  the  corporation's 
claims  representative  and  the  defendant's  lawyer  would  have  been 
the  same  individual.  Governmental  units  named  as  defendants 
were  also  excluded  from  the  sample  for  similar  reasons. 

Multiple  defendants  offered  an  additional  design  problem.  The 
unit  of  analysis  had  been  defined  as  all  defendants  listed  on  the 
court  calendars.  For  cases  where  two  or  more  related  individuals 
Jiving  together  were  named  as  joint  defendants,  such  as  a married 
couple,  the  interviewer  was  instructed  to  complete  first  a regular 
questionnaire  with  either  of  the  individuals  named  as  a defendant. 
A shorter  supplemental  questionnaire  was  then  completed  for 
each  additional  family  member  named  as  a defendant.  The  sup- 
plemental form  repeated  only  those  questions  from  the  tegular 
questionnaire  for  which  the  answers  would  be  expected  to  vary 
between  members  of  the  same  family,  such  as  whether  the  person 
was  driving  at  the  time  of  the  accident  and  the  basic  demographic 
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data.  Interviewers  were  permitted  to  complete  the  supplemental 
questionnaires  with  either  the  original  respondent  or  with  the 
family'  member  being  asked  about.  A review  of  the  interviewing 
results  shows  that  all  the  supplemental  questionnaires  were  com- 
pleted with  the  original  respondents. 

If  a defendant  had  died  between  the  time  the  suit  was  filed  and 
the  interviewing  period  (regardless  of  the  cause  of  death),  the 
interview  was  assigned  as  an  automatic  nonresponse;  that  is,  no 
alternative  respondent  was  allowed.  In  suits  where  the  defendant 
was  the  administrator  of  an  estate,  the  interviewers  were  instructed 
to  alter  t ie  wording  of  selected  questions  on  the  regular  ques- 
tionnaire by  hand,  so  that  the  cuestions  asked  administra'ors 
would  be  consistent  and  appropriate. 

Use  of  the  most  economical  da'ta-collection  technique  was  also 
given  careful  consideration.  A pretest  in  Detroit  indicated  that  the 
schedule  could  be  completed  by  telephone  with  little  difficulty,  and 
it  was  decided  to  attempt  the  initial  defendants’  interviews  by 
telephone,  with  the  option  of  switching  to  personal  interviews  if 
any  serious  difficulties  were  encountered  in  the  telephone  pro- 
cedure. Early  interviews  were  assigned  with  out-of-state  respond- 
ents who,  in  most  cases,  would  have  been  assigned  as  automatic 
nonresponses  using  personal-interview  techniques.  These  inter- 
views were  successful,  and  a second  group  of  interviews  were 
assigned  with  respondents  in  outlying  areas  of  Michigan  (these 
respondents  would  have  represented  high-cost  personal  inter- 
views), and  finally  the  entire  sample  was  assigned  for  telephon- 
ing. 

2.  Questionnaire  Content  and  Eligible  Respondents 

The  defendants'  questionnaire  was  designed  to  examine  the 
process  by  which  respondents  defended  themselves.  Every  effort 
was  made  to  separate  aspects  of  the  defense  process  from  other 
aspects  of  the  accident,  such  as  medical  expenses,  property  dam- 
age, etc.,  incurred  by  the  defendant  or  his  family.  An  effort  also 
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was  made  to  focus  the  respondent’s  attention  on  issues  relevant  to 
the  particular  suit  sampled,  as  opposed  to  other  suits  that  might 
have  been  brought  against  the  defendant  or  his  family  as  a result 
of  the  same  accident.  The  questionnaire  itself  asked  about  the 
defendant’s  involvement  in  legal  proceedings  (including  the  man- 
ner in  which  he  secured  counsel),  and  about  any  psychological 
and  economic  penalties  incurred  by  himself  or  his  family  as  a 
result  of  the  suit. 


3.  Results 

The  response  rates  and  the  determination  of  interview  weights 
for  completed  defendants’  questionnaires  are  shov/n  in  Table  9- 
15.  Note  that  even  with  the  exclusion  of  private  corporations  and 
governmental  units,  there  were  267  individual  defendants  listed 
on  the  207  sampled  court  calendars.  The  same  207  calendars 
listed  207  plaintiffs  (Table  9-7);  thus,  the  court  calendars  listed 
about  1.29  defendants  for  each  plaintiff. 

The  mobility’  of  American  families,  particularly  younger  fam- 
ilies, increases  the  problem  of  tracing  respondents  when  the  re- 
spondents (and  their  addresses)  are  selected  from  a list  that  is 
not  current.  Table  9-16  indicates  clearly  that  the  effectiveness  of 
the  procedures  used  to  locate  defendants  was  important  in  estab- 
lishing the  over-all  response  tate  for  this  pact  of  the  study.  Of  the 
267  defendants  designated  for  personal  interviews,  55  (65.5  per- 
cent of  the  nonresponse  group)  could  not  be  located;  no  inter- 
views could  be  attempted  with  this  group.8  Table  9-16  also  shows 


8 The  reader  insy  be  interested  in  a brief  description  of  the  procedures  used  to 
locate  sampled  defendants.  An  initial  effort  was  made  to  find  some  known  address 
for  each  defendant,  regardless  of  how  old  it  was.  Court  records  provided  complete 
1957  addresses  for  about  one-half  the  sample;  however,  for  most  of  the  remaining 
defendants,  the  1957  city  of  residence  was  mentioned  somewhere  in  the  court 
records.  Current  ( 1 96 1 ) telephone  books  were  checked  next,  arJ  if  the  1957 
address  found  in  the  court  records  and  the  address  shown  in  a current  telephone 
book  wcie  the  same,  the  questionnaire  was  tssigned  for  interviewing.  For  cases 
where  the  court  records  indicated  only  a city  of  residence,  the  current  telephone 
book  for  that  city  was  checked,  and  if  an  individual  with  the  same  name  as  the 
defendant  was  listed  in  the  telephone  book,  and  there  w&s  some  reason  to  be 


// 
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that  only  8 of  the  defendants  refused  to  be  interviewed  by  tele- 
phone. 

K.  Mail  Interviews  with  Defendants'  Lawyers 
1.  Introduction 

A final  empirical  study  required  interviewing  all  defendants' 
lawyers  listed  on  the  sampled  court  calendars,  using  a two-page 
mail  questionnaire.  This  study  was  undertaken  so  that  the  views 
of  all  major  participants  in  the  litigation  process  would  be  in- 
cluded in  die  final  report,  thus  permitting  comparisons  on  a case- 
study  basis.  Although  the  personal-interview  questionnaire  used 
with  plaintiffs’  lawyers  is  much  more  detailed  than  the  mail  ques- 
tionnaire used  Y/ith  defendants’  lawyers,  the  nine  questions  in- 
cluded on  the  defendants’  lawyers’  questionnaire  were  also  asked 
of  the  plaintiffs’  lawyers.  The  questions  are  primarily  concerned 
with  the  major  issues  or  sources  of  disagreement  in  the  care  and 
the  important  factors  underlying  determination  of  a final  settle- 
ment. 

The  principal  factor  which  limited  the  length  of  the  defend- 
ants’ lawyers’  questionnaire  and  dictated  the  use  of  a mail  question- 
naire was  the  concentration  of  a large  percentage  of  the  defense 
work  in  a relatively  few  firms.  In  most  instances,  these  firms 
are  affiliated  with  one  of  the  insurance  companies  writing  large 
amounts  of  automobile  insurance  in  Michigan.  Table  9-17  com- 
pares the  concentration  of  defense  lawyers  with  the  concentration 
of  plaintiffs’  lawyers  for  the  207  sampled  court  cases.  It  is  quickly 
apparent  that  a relatively  few’  law  firms  handle  a large  propor- 

confident  that  the  telephone  listing  W3S  that  of  the  defendant  (e.g.,  if  there  was 
only  one  individual  with  the  defendant’s  name  listed  in  the  telephone  hook,  or  if  ft 
telephone  Look  listing  showed  the  same  last  name  and  the  same  address  as  shown 
oo  the  court  records,  but  perhaps  a different  fust  came),  the  questionnaire  was 
assigned  for  interviewing. 

For  defendants  still  not  located,  additional  efforts  were  made  to  find  the  police 
report  for  the  accident  (which  might  list  either  the  the  address  of  the  defendant  or 
the  name  and  address  of  an  individual  who  would  know  the  present  whereabouts 
of  the  defendant).  Further  investigation  was  carried  out  at  the  R.  L.  Folk  Library 
in  Detroit  acd  at  the  Michigan  Drivers*  License  Bureau  in  Lansing.  Some  success 
was  achieved  through  each  of  the  soutccs  mentioned. 
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TABLE  9-1 6 


Reason  Why  Questionnaire  Was  Not  Completed 
with  Defendant,  Court  Sample 


Reason  why  question - 

Number  of  question - 

Percent  of  question * 

naire  was  not  completed 

naires  not  completed 

naires  not  completed 

Respondent  refused  to 
be  interviev/ed  (does 
not  want  to  talk  about 
it;  too  upset;  too  busy; 
can't  remember  details) 

9.5% 

8 

Respondent  ill  or  de- 

ceased (no  one- else 
available  to  complete 
questionnaire) 

16 

19.0 

Respondent  absent  (out 

of  state  or  country  dur- 

ing interviewing  period; 
no  one  else  available  to 

6.0 

complete  questionnaire) 
Respondent  could  net  be 

5 

located  (unable  to  deter- 
mine current  address  or 

• 

65.5 

telephone  number) 
Total  questionnaires  not 

55 

84 

100.0% 

completed 

tion  of  all  defense  work  for  Michigan  automobile  suits.  To  be 
more  precise,  the  survey  indicates  that  five  law  firms  handle  38 
percent  of  all  the  defense  work  resulting  from  Michigan  personal- 
injury  automobile  suits.  (One  law  firm  accounted  for  3 6 percent 
of  the  total  nonresponse  for  this  part  of  the  study.)  With  this 
heavy  concentration  of  defense  work,  it  seemed  unreasonable  to 
expect  that  the  lawyers  involved  would  be  willing,  or  should  be 
asked,  to  complete  more  than  a few  questions  about  any  one  case. 
These  considerations  led  inevitably  to  the  conclusion  that  the  final 
questionnaire  should  be  short  and  that  the  respondent  should  be 
allowed  to  complete  the  schedules  as  his  time  permitted.  Of 
course,  if  it  had  not  been  essential  to  interview  about  the  specific 
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TABLE  9-17 

Concentration  of  Cases  for  Sampled  Plaintiffs* 
and  Defendants ' Lawyers,  Court  Sample 


Number  of  sampled  Cases 
bandied  by  lawyer 
or  law  fun 

Whether  lawyer  or  law  frtn 
represented  plaintiff  or  defendant 
Plaintiffs  Defendants 

1 

87% 

72% 

2-3 

11 

15 

4-6 

2 

7 

7—9 

0 

1 

10-12 

0 

2 

13—15 

0 

1 

16 — 24 

0 

1 

25  or  more 

0 

1 

Total 

100% 

100% 

Total  number  of  lawyers  or 
law  firms  in  sample 

215* 

102 

Range  of  number  of  cases 
handled  by  single  lawyer 
or  law  firm 

1 — 5 cases 

1 — 28  cases 

* A small  number  of  these  lasers  were  not  listed  on  the  207  sampled  court 
calendats*  but  wetc  interviewed  when  it  viis  subsequently  determined  that  they 
played  a major  role  in  settling  the  case. 

cases  in  the  court  sample,  the  sample  would  have  been  designed 
to  avoid  the  extreme  clustering  of  interviews  evident  here.  But 
an  efficient  representative  sample  would  still  be  concentrated  to 
some  extent  in  certain  law'  firms  because  the  cases  are  concentrated 
in  these  firms. 

2.  Results 

The  response  rates  and  determination  of  interview  weights  for 
interviews  completed  by  defendants’  lawyers  are  shown  in  Table 
9-18. 

L.  Estimation  of  Economic  Losses 

' 1.  Introduction 

For  the  purposes  of  this  study,  comparing  losses  with  compensa- 
tion, it  is  the  loss  to  the  individual  or  his  family  which  is  relevant; 
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A SAMPLE  WITHIN  A SAMPLE 


Previous  discussions  of  survey  techniques  of  gathering  data 
have  involved  examples  where  the  universe  under  observation  has 
been  conveniently  contained  in  some  sort  of  official  record  or  list. 
Thus  Zeisel  referred  to  Census  data  which  is  collected  by  interview- 
ing "every  fifth  household, " ard  Conard's  study  elicited  information 
from  a universe  defined  by  "police  accident  reports.  " 

In  the  following  reading,  however,  research  employing  the 
survey  technique  is  discussed  in  relation  to  a universe  which  cannot 
be  defined  by  readily  accessible  lists  of  people,  things,  or  events. 
The  Gallup  Poll  is  concerned  with  predicting  election  results.  The 
universe,  consequently,  is  defined  as  the  voting  public  of  the  United 
States.  And  this  means  that  the  Poll  is  concerned  not  just  with  the 
adult  population  in  a particular  community  or  state  but  with  those 
who  in  fact  will  vote  in  the  country  as  a whole.  And,  of  course, 
there  is  no  list  containing  the  names  of  all  such  people  in  the  United 
States. 


The  solution  to  this  problem  is  to  engage  in  a multi-stage 
sampling  process.  The  universe  is  divided  into  small  manageable 


units  which  may  be  sub-divided  in  turn.  As  Paul  Perry  indicates  t 
"A  systematic  sample  of  cities  and  minor  civil  divisions  is  drawn 
....  For  election  survey  purposes  a selection  of  smaller  units 
. , . is  drawn  ....  In  cities,  such  smaller  units  are  usually  wards. 
Within  these  units  precincts  are  selected.  " 

It  must  be  remembered  that  the  success  of  this  approach, 
is  dependent  upon  preserving  the  basic  idea  of  random  sampling, 
namely,  that  every  unit  finally  chosen  has  an  equal  probability  of 
being  selected.  It  is  only  when  this  assumption  is  valid  that  we 
are  able  to  use  much  of  the  statistical  theory  basic  to  statistical 
inference — an  issue  we  will  explore  in  the  next  chapter. 


From  Paul  Perry,  "Election  Survey  Procedures  of  the 
Galluo  Poll, " The  Public  Opinion  Quarterly , 
24  (Fall,  1960),  pp,  531-542.  Reprinted  by  permission. 
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Paul  Perry,  "Election  Survey  Procedures  of  the  Gallup 
Poll,"  The  Public  Opinion  Quarterly,  Vol . 24.  Fall. 
1960,  pp.  531-542. 


The  development  of  sample  surveys  of  the  population  has  been 
accompanied  by  their  highly  publicized  use  as  a means  of  determining 
the  voting  public's  political  preferences.  The  use  of  sample  surveys 
for  this  purpose  has  provided  a difficult  test  of  the  methol,  particu- 
larly in  the  United  States,  where  a large  part  of  the  adult  population 
fails  to  vote.  Testimony  to  this  effect  appeared  in  Chapter  I of  The 
Pre-election  Polls  of  1948,  a Social  Science  Research  Council  publica- 
tion issued  in  1949.  The  statement  was:  “The  committee  and  the  staff 
would  like  to  point  out  that  pre-election  polling  and  predictions 
constitute  one  of  the  more  severe  tests  of  polling  or  survey  method- 
ology. There  are  more  steps  in  pre-election  polling  at  which  error 
can  enter  than  there  are  in  many  other  applications  of  survey  tech- 
nique.” 

Since  a straightforward  sample  survey  of  the  total  adult  population 
on  opinion  with  respect  to  parties  or  candidates  quantifies  only  one 
variable  in  the  formula,  this  has  provided  the  Gallup  Poll  with  some 
difficult  problems.  What  is  specifically  required,  of  course,  is  an  esti- 
mation procedure  that  takes  into  account  the  major  variables  and 
conditions.  Progress  in  the  development  of  such  an  estimation  pro- 
cedure by  the  Gallup  Poll  has  increased  the  reliability  of  its  pre- 
election estimates  of  the  division  of  the  vote  based  on  survey  data. 
For  this  reason  it  probably  has  some  intrinsic  interest  for  many  who 
are  engaged  in  survey  research,  and  the  methods  used  may  prove 
useful  in  other  political  research. 

The  methods  of  sampling  the  voting  population  used,  relating 
voting  participation  to  candidate  preferences,  dealing  with  the  prob- 
lem of  the  undecided,  obtaining  unbiased  expressions  of  candidate 
preference,  and  measuring  reactions  to  major  events  close  to  election 
day  are  of  methodological  concern  to  all  engaged  in  political  research. 

In  this  discussion  data  reported  concerning  election  surveys  and 
samples  are  based  on  the  United  States  excluding  the  South.1  The 

• The  author  ij  president  of  the  Gallup  Organization,  Inc,  and  research  director 
of  the  Gallup  Poll. 

1 States  excluded:  Alabama,  Arkansas,  Florida,  Georgia,  Kentucky,  Louisiana, 
Mississippi,  North  Carolina,  Oklahoma,  South  Carolina,  Tennessee,  Texas,  and 
Virginia. 
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South  has  been  excluded  because  it  presents  special  difficulties  which 
introduce  additional  variables.  For  example,  in  Congressional  elections 
many  districts  in  the  South  are  uncontested.  Another  difficulty  is 
presented  by  the  fact  that  the  South’s  share  of  the  total  vote  fluctuates 
between  elections  to  a much  greater  extent  than  is  true  for  other 
major  regions.  The  limited  scope  of  the  article  precludes  going  into 
the  details  of  the  methods  of  coping  with  these  factors. 

The  data  reported  are  based  on  Gallup  Foil  experience  in  the  elec- 
tions of  1950,  1952,  1951,  1956,  and  1958.  In  1950  certain  of  the  basic 
elements  in  the  estimation  procedure  here  described  were  first  adopted 
and  used.  We  believe  it  is  noteworthy  that  as  a result  of  using  these 
methods,  our  pre-election  estimates  of  how  the  popular  sole  divided 
in  four  of  these  elections  (1950,  1954,  1956,  and  1958)  were  the  four 
most  accurate  estimates  of  'he  twelve  which  the  Gallup  Poll  has  made 
in  national  clectiot  s.  The  first  of  the  twelve  estimates  was  made  in  the 
1 930  presidential  election. 


THE  SAMPLE 

The  sample  of  areas  used  for  the  Gallup  Poll  election  surveys  is 
the  same  as  the  sample  of  areas  used  for  the  Gallup  Poll  surveys  be- 
tween election  periods.  These  areas  are  drawn  in  accord  with  the 
principles  of  the  theory  of  probability  sampling.  Between  election 
period?  the  areas  consist  of  block  clusters  and  rural  segments.  In  the 
two  election  surveys  which  provide  the  basis  for  the  final  pre-election 
estimate,  the  areas  consist  of  election  precincts,  drawn  in  the  same 
manner  as  the  block  dustets  and  rural  segments.  The  change  to  elec- 
tion precincts  in  the  period  immediately  preceding  the  election  is 
made  became  of  the  greater  control  provided  with  regard  to  the 
political  representativeness  of  the  sample  of  areas.  For  surveys  on 
most  general  issues  the  precinct  as  the  basic  areal  sampling  unit 
would  have  little  or  no  advantage  over  block  clusters  and  segments 
and  might  have  some  disadvantages. 

The  sample  of  areas  is  drawn  in  the  following  manner:  A systematic 
sample  of  cities  and  minor  civil  divisions  is  drawn  from  regional- 
city  sire  strata  with  probability  of  selection  proportional  to  sire. 
Within  places  so  drawn,  for  election  survey  purposes  a selection  of 
smaller  units  for  which  election  data  are  available  is  drawn  in  the 
same  manner.  In  cities  such  smaller  units  are  usually  wards.  Within 
these  units  precincts  are  selected.  The  selection  of  the  precincts  pro 
cccds  in  this  manner:  Election  results  for  the  previous  national  elec- 
tion are  obtained  for  each  precinct  in  thr  ward.  One  precinct  is 
then  drawn  with  probability  of  selection  proportional  to  the  pre- 
cinct’s total  vote.  Within  the  precinct  a sjstcmatic  sample  of  house- 
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holds  is  selected,  and  one  adult  from  each  household  is  interviewed. 
The  total  number  of  adults  in  each  household  is  obtained  in  the 
survey  to  provide  for  a size-of-household  correction  when  necessary. 

For  each  precinct  so  selected,  the  proportions  in  which  the  vote 
divided  by  parties  in  the  previous  one  or  two  national  elections  is 
computed.  In  2958,  for  example,  the  percentage  voting  for  Eisen- 
hower of  the  Eisenhower-Stevenson  vote  in  1956  was  computed  for 
each  precinct  drawn  into  the  sample.  Since  the  precincts  were  drawn 
with  probability  proportional  to  size,  the  mean  Eisenhower  percent- 
age of  the  precincts  in  the  sample  served  as  a measure  of  the  repre- 
sentativeness of  the  areas  drawn.  For  example,  if  the  mean  Eisenhower 
percentage  for  such  a sample  was  58.5,  the  sample  of  precincts  was 
07  percentage  points  higher  than  the  vote  for  Eisenhower  in  195G, 
when,  of  the  vote  for  Eisenhower  and  Stevenson,  57.8  per  cent  voted 
for  Eisenhower.  When  such  a bias  is  found  to  exist,  a simple  correction 
of  the  final  estimate  can  be  made,  and  the  bias  in  the  sample  of 
precincts  with  respect  to  the  previous  election  removed. 

The  vole  in  a previous  election  is  perhaps  the  mort  efficient  control 
one  can  use  for  an  election  survey  (unless  ihe  span  of  time  between 
elections  is  very  great),  either  as  a basis  for  stratification  or  as  a part 
of  the  estimation  procedure.  However,  difficulties  are  involved  in 
using  it  as  a basis  for  stratification  in  a national  election,  at  least  if 
precincts  are  the  areal  units.  Precinct  data  are  not  available  from  a 
central  source;  a great  deal  of  time  and  effort  must  be  expended  to 
obtain  election  results  for  even  a modest  number  of  precincts.  The 
use  of  precinct  data  becomes  quite  laborious  even  with  a double 
sampling  procedure,  in  which  a large  sample  of  precincts  is  drawn 
first,  strata  are  formed,  and  the  final  sample  is  drawn  from  the  strata. 
Therefore,  it  appears  to  be  more  efficient  to  use  the  past  voting  data 
in  forming  the  estimates  afler  the  sample  is  taken  instead  of  using 
them  in  drawing  the  sample. 

The  correlation  of  the  vote  b)  precincts  between  elections  is  fairly 
high.  For  example,  wc  have  found  the  variance  between  precincts  in 
terms  of  shift  or  change  in  the  percentage  of  the  vote  for  a candidate 
of  a given  patty  between  two  electrons  is  only  about  two-thirds  as 
large,  typically,  a?  the  variance  between  them  in  terms  of  parly  or 
candidate  preference  in  a single  election. 

In  the  1950,  195s,  and  1956  samples  the  deviation  of  the  mean 
precinct  party  |vrecntage  for  the  previous  election  used  as  a control 
was  small;  in  1951  and  1958  it  was  of  some  consequence.  The  devia- 
tions of  the  means  for  each  previous  election  are  shown  in  Table  1. 

This  step  cannot  be  expected,  of  course,  to  improve  matters  in  every 
instance.  Some  of  the  time  it  will  make  things  worse.  In  view  of  the 
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TABLE  1 

Deviations  of  the  Me-  <s  of  the  Samples  of  Precincts  * 
in  Per  Cent  Republican  of  the  Democratic-Republican  Vote 


Survey  Year 

Past  Election 

Used 

Devtations\ 

Number  of 
PrecincU 

1950 

1946 

+0.7 

50 

1952 

1948 

+0.3 

100 

1954 

1952 

+2.8 

105 

1950 

1952 

—0.2 

171 

1958 

1950 

-2.2 

150 

• Outside  the  South. 

f For  example,  the  1940  election  results  In  precincts  in  the  samp’?  in  1950  had 
A mean  per  cent  Republican  vote  0.7  of  a percentage  point  higher  than  was  true 
In  the  United  States  outside  the  South.  As  a sample  of  precincU,  they  were  this 
much  in  error  in  terms  nf  the  remits  of  the  previous  election. 

correlation  between  election*  in  the  division  of  the  vote,  however, 
it  can  be  expected  to  improve  matters  at  least  the  majority  o f times. 

The  above  illustrates  one  of  the  advantages  of  using  precincts  as 
the  sampling  unit  as  compared  with  other  areal  units  such  as  census 
tracts  or  enumeration  districts.  While  it  is  net  possible  to  obtain  for 
precincts  census  statistics  such  as  those  concerning  demographic  char- 
acteristics of  the  population,  this  disadvantage  is  more  than  offset  in 
our  opinion  by  the  availability  of  election  data.  In  fact,  the  avail- 
ability of  census  data  by  small  areal  units  is  of  limited  value  in  any 
case,  because  it  is  useful  only  in  the  first  few  years  following  the 
decennial  census.1 

Within  the  precinct,  unless  it  has  an  exceedingly  large  population, 
the  sampling  interval  used  in  the  selection  of  households  requires 
the  interviewer  to  cover  the  area  entirely.  That  is  to  say,  the  sample 
does  not,  in  most  cases,  consist  of  clusters  of  households.  The  inter- 
viewer is  given  a randomly  selected  starting  point  and  takes  every 
nth  household  throughout  the  precinct.  In  larger  precincts,  a sample 
of  blocks  or  segments  is  selected  and,  within  the  blocks  or  segments 
drawn  into  the  sample,  a systematic  sample  of  households  is  drawn 
as  above.  These  are  precincts  where  the  sampling  interval  for  cover- 
age of  all  blocks  in  the  precinct  would  be  so  large  that  it  would 
create  serious  problems  of  maintaining  an  accurate  count. 

Within  the  household  the  individual  who  is  interviewed  is  selected 
from  among  those  who  arc  at  home,  in  the  following  manner:  The 
interviewer  asks  first  to  speak  to  the  youngest  man  of  voting  age  who 

• The  use  of  ptetincts  as  die  areal  sampling  unit  In  election  imveyi  aim  has  some 
practical  advantages  met  use  of  other  dWnoni  such  as  census  trat«s  or  enumeration 
districts.  For  example,  if  intrniewer  failure  causes  loss  of  a sampling  area  too  late 
fot  replacement,  Inosdodge  of  the  precious  toting  behasmr  of  the  area  cnaMm  one 
to  adjust  statistically  for  any  resultant  political  Mas. 
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is  at  home;  if  no  man  is  at  home  he  asks  to  speak  to  the  youngest 
woman  who  is  at  home.  If  no  one  is  at  home  or  the  indicated  person 
refuses,  the  interviewer  is  tohl  to  go  to  the  next  adjacent  dwelling 
unit.  In  such  a case  the  interviewer  resumes  the  interval  count  from 
the  original  dwelling  selected.  The  interviewer  is  given  a male-female 
assignment.  Thus  if,  following  the  procedure  described  above,  the 
number  of  men  assigned  is  obtained,  m subsequent  households  the 
interviewer  asks  to  speak  only  to  the  youngest  woman  at  home.  In- 
terviewing hours  are  from  4 p.m.  fo  10  p.m.  A record  of  refusals  is 
kept,  with  information  concerning  the  person’s  sex,  estimated  age, 
education,  race,  and  whether  the  resident  is  in  a high*,  medium-,  or 
low-rental  area. 

A point  commonly  made  about  the  use  of  precincts  as  sampling 
units  is  that,  since  the  boundaries  of  many  of  them  are  frequently 
changed,  their  usefulness  is  limited  as  a political  control'  This  objec- 
tion is  based  on  a mistaken  notion  of  how  they  are  user.  The  vote  in 
a previous  election  is  the  control,  and  therefore  the  boundaries  of  the 
precincts  as  they  were  at  the  time  of  that  election  are  the  ones  used. 
It  makes  no  difference  whatsoever  how  frequently  they  are  changed 
thereafter.  It  would  be  useful,  of  course,  in  analyzing  the  accuracy  of 
the  survey  results  if  the  boundaries  were  constant  from  election  to 
election.  We  have  found  that,  on  the  average,  about  75  per  cent  of 
precitifU  remain  the  same  over  a four-year  period,  and  we  use  these 
for  error  analysis. 


LATE  CAMPAIGN  SUltVEY 

A discussion  of  the  use  of  the  precinct  a < the  ultimate  areal  sampling 
unit  in  the  design  leads  into  a description  of  the  method  used  to  bring 
the  measurement  of  voting  i .lenlions  as  close  to  the  election  as  possible. 
The  use  of  precincts  on  a large  scale  by  the  Gallup  Poll  began  with 
the  1950  congressional  election.  Although  a method  of  carrying  out  a 
late  October  survey  and  getting  the  results  in  time  for  pre-election 
publication  had  been  wotked  out  and  used  in  1910  and  1944  (unfor- 
tunately not  in  1948),  a less  cumbersome  method  was  sought.  A com- 
bination of  the  telegraphic  communication  of  survey  results  to  the 
home  office  with  the  control  provided  by  the  use  of  precincts  seemed 
to  be  particularly  advantageous  and  was  the  method  adopted.  The 
two  final  election  surveys  are  usually  conducted  in  October.  The  first 
one  usually  takes  place  the  first  week  in  October;  the  second  cn  Wednes- 
day,  Thursday,  and  Friday  of  the  week  prior  to  the  election.  The  first 
survey  questionnaires  are  returned  by  mail.  However,  the  interviewers 
are  also  required  to  carry  out  all  the  steps  involved  in  the  final  tele- 
graphic survey,  including  tabulation  of  the  results,  recording  and  check* 
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ing  the  tabulations  on  forms  provided,  and  preparation  of  the  telegram. 
We  find  that  about  80  per  cent  of  the  interviewers  do  this  quite  pre- 
cisely and  accurately,  about  15  per  cent  make  small  errors  which  cannot 
seriously  affect  the  results,  and  the  remainder  make  serious  errors  which 
have  to  be  dealt  with.  In  the  latter  case,  the  questionnaires  arc  returned 
to  the  interviewers  and  they  are  requested  to  read  their  directions  once 
again,  do  over  the  tabulation,  and  record  the  corrected  data.  Telephone 
consultations  are  then  conducted  with  each  of  them  to  be  sure  their 
confusion  has  been  eliminated.  Those  who  make  minor  errors  are  also 
told  about  it,  and  once  again  great  stress  is  laid  on  the  necessity  of 
accuracy.  A simplified  breakdown  of  the  data  has  to  be  employed,  of 
course,  to  minimize  the  possibility  or  errors.  For  example,  interv  iewers 
are  required  only  to  divide  their  questionnaires  into  two  easily  classi- 
fied groups— one  group  consisting  of  those  who  say  they  plan  to  vote, 
the  second  group  comprising  all  others.  They  tabulate  the  voting 
preference  only  among  those  who  say  they  plan  to  vote.  Interviewers 
are  also  asked  to  tabulate  and  telegraph  the  educational  attainment 
of  all  respondents  in  six  categories,  but  not  the  vote  by  education 
categories.  The  latter  step  would  increase  the  chance  of  transmission 
errors  so  much  that  it  is  not  considered  worth  the  risk. 

As  a variable  closely  associated  with  socio-economic  status  and  in- 
come, the  educational-attainment  information  is  used  to  standardize 
tht  two  surveys  in  this  respect.  Since  the  vote  by  education  is  not 
obtained,  for  the  reason  indicated  above,  any  difference  between  the 
two  samples  by  education  is  dealt  with  in  this  manner:  The  first  survey 
is  tabulated  by  the  six  education  categories  (college  complete,  college 
Incomplete,  high  school  complete,  high  school  incomplete,  grammar 
school  seventh  and  eighth  grades,  gtammar  school  sixth  grade  or 
less),  and  the  vote  or  preference  in  each  category.  The  data  are  then 
adjusted  to  fit  the  education  distribution  of  the  second  sample,  and 
the  change  in  soling  preference  which  results  is  applied  to  the  results 
of  the  second  survey.  This  sample  control  of  the  second  survey  is 
designed  primarily  to  protect  against  the  consequences  of  interviewer 
failure  to  do  a careful  coverage  of  the  precinct  the  second  time  around. 
Since  dm  control  has  been  employed,  the  effect  of  the  adjustment 
has  been  in  no  case  more  than  one-  or  two-tenths  of  a percentage  point. 

TURNOUT 

Another  important  step  in  the  proceduie  for  estimating  election 
results  is  projecting  the  proportion  of  the  population  old  enough 
to  vote  who  will  vote,  or  what  might  be  called  the  tutnout  ratio. 
Our  estimate  of  thh  ratio  is  derives!  ftom  the  results  of  several  ques- 
tions which  are  related  to  voting  participation.  Some  combinations 
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used,  for  example,  are  based  on  these  two  questions:  “How  much 
thought  have  you  given  to  the  coming  November  elections — quite  a 
lot,  or  only  a little?1'  and,  “Do  you,  yourself,  plan  to  vote  in  the  elec- 
tion this  November,  or  not?"  If  the  answer  is  “yes,”  the  respondent 
is  asked,  “How  certain  are  you  that  you  will  vote — absolutely  certain, 
fairly  certain,  or  not  certain?"  For  previous  elections  a record  has 
been  kept  of  the  relationship  between  the  proportion  who  answer 
such  questions  in  a category  indicating  a likelihood  of  voting  and 
the  actual  turnout  ratio  in  the  election.  From  these  relationships  a 
sufficiently  accurate  estimate  of  the  turnout  ratio  can  be  made  for 
estimating  preference  in  that  part  of  the  sample  representative  of  :hose 
who  will  vote.  For  example,  the  method  projected  a turnout  percent- 
age of  61  in  1956  as  compared  with  the  actual  turnout  ratio  of  60; 
in  1958  it  projected  a turnout  of  45  as  compared  with  the  actual 
turnout  of  44.  In  1950,  195?,  and  1954,  the  turnout  ratio  was  projected 
from  available  registration  data. 

The  variable  of  voting  participation  is  an  important  one  in  United 
States  elections,  and  a technique  for  taking  it  into  account  is  required. 
Surveys  during  the  past  twenty-five  years  have  shown  that,  during 
this  period,  in  the  aggregate  those  persons  who  favor  the  Democratic 
Farty  or  candidate  in  national  elections  are  less  likely  to  vote  than 
those  who  favor  the  Republican  ^arty  or  candidate.  Even  if  this  were 
not  true,  however,  so  long  as  only  a part  of  the  electorate  votes,  steps 
have  to  be  taken  to  provide  a sample  of  that  particular  part  of  the 
population,  because  it  may  well  differ  from  the  otal  population  old 
enough  to  vote.  The  turnout  ratio  is  particularly  low  in  national 
nonpresidcntial  elections.  The  ratios  for  the  past  five  elections  are 
shown  in  Table  2. 


TABLE  2 

Pe*  Cent  or  CntiED  States  PorvuvnoN  Old  Enough 
to  Vote  Who  Voted  w 1950,  1952,  1954,  1956,  1958 


Election 

Typtof 

E fed  ion 

Pet  Cent 
Who  Voted 

1950 

Congressional 

41.9 

1952 

PresidentiJ 

62.5 

nst 

Congressional 

42.4 

.056 

Presidential 

60.4 

1958 

Congressional 

43.9 

To  make  allowance  for  this  actor  we  use  questions  such  as  these: 

How  much  thought  have  you  given  to  the  coming  November  elections— 
quite  a lot,  or  only  a little? 

Have  yo  1 ever  soted  in  this  precinct  or  district? 
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Where  do  people  who  live  in  this  neighborhood  go  lo  vote? 

Are  you  now  registered  so  that  you  can  vote  in  the  election  this  November? 

Do  you  plan  to  register  so  that  you  can  vote  in  the  November  election? 

How  often  would  you  say  you  vote — always,  nearly  always,  part  of  the  time, 
or  seldom? 

Do  you,  yourself,  plan  to  vote  in  the  election  this  November? 

How  certain  are  you  that  you  will  vote — absolutely  certain,  fairly  certain, 
or  not  certain? 

In  the  election  in  November,  1956— when  Eisenhower  ran  against  Steven- 
son—did  things  come  up  which  kept  you  from  voting,  or  did  you  happen 
lo  vote? 

Post-election  studies  indicate  that  the  answers  to  these  questions  arc 
related  to  voting  participation,  and  indicate  the  extent  to  which  they 
are  related.  A scoring  system  based  on  such  studies  is  applied  to  the 
answers  of  respondents  to  each  question.  The  scoring  system  is  such 
that  the  greater  their  likelihood  of  voting,  the  higher  their  score.  Re- 
spondents arc  then  ranked  on  the  basis  of  their  scores. 

Before  using  the  turnout  scale  scores,  all  persons  are  excluded  who 
say  they  are  not  registered  and  do  not  plan  to  register,  or  who  say 
they  do  not  plan  to  vote.  Thus  a person  who  is  not  registered  and 
does  not  plan  to  register  or  vdio  says  he  or  she  docs  not  plan  to  vote 
is  not  included  in  the  ' 'ample  of  persons  u|>on  which  the  election 
estimate  is  based.  These  people  are  excluded  because  our  studies 
and  the  studies  of  others  have  indicated  that  a negligible  percentage 
of  them  vote,  something  on  the  order  of  between  1 and  5 per  cent. 
Using  the  remaining  sample  of  respondents,  each  class — on  the  basis 
of  turnout  scale  scores — is  tabulated  on  party  or  candidate  preference 
(depending  upon  whether  it  Is  a national  presidential  or  congres 
sional  election);  the  division  of  preference  is  then  computed  for  the* 
first  scale  position  (those  most  likely  to  vote);  the  first  and  second 
scale  positions  combined;  the  first,  second,  and  third  scale  positions 
combined;  and  so  on.  With  this  classification  and  accumulation,  and 
the  turnout  ratio  previously  computed,  that  pottion  of  the  total 
sample  with  the  greatest  likelihood  of  voting  (on  the  basis  of  these 
criteria)  is  selected,  and  the  preference  computed  for  that  group. 
Usually  this  requires  intcrpo’ation  between  classes. 

THt  UNDF.CIDCD 

For  accurate  estimation  of  the  probable  division  of  the  vote,  it  is 
essential  to  minimize  the  proportion  ol  replies  which  are  in  the  cate- 
gory of  undecided.  For  analytical  purposes  it  is,  of  course,  quite  use- 
ful to  use  methods  for  maximizing  the  undecided,  and  this  can  be 
done  quite  easily.  The  opposite  objective  of  reducing  the  undecided 
can  be  quite  difficult,  however,  and,  of  course,  when  it  is  truly  a 
case  of  being  completely  undecided,  impossible.  Two  basic  approaches 
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we  have  adopted  are  (i)  to  use  a question  which  makes  the  decision 
as  easy  as  possible  and  {2)  to  ask  the  undecided  person  if  he  can 
say  in  which  direction  he  is  leaning  in  his  preference.  The  question 
combination  being  used  in  current  Gallup  Poll  surveys  is: 

a.  If  the  Presidential  election  were  being  held  TODAY*  which  candidates 
would  you  vote  for — Nixon  and  Lodge  or  Kennedy  and  Johnson? 

IF  UNDECIDED  OR  REFUSED  ASK: 
b.  As  of  today,  do  you  lean  more  to  Nixon  and  Lodge  or  more  to 
Kennedy  and  Johnson? 

Before  the  'V  question  is  asked,  the  respondent  is  handed  a card 
on  which  are  printed  the  names  of  the  two  major  parties  and  the 
respective  candidates  underneath  their  party  designation. 

The  question  has  been  framed  to  ask  for  the  respondent’s  choice  if 
the  election  were  to  be  held  "today”  because  it  provokes  less  indecision 
than  do  questions  which  ask  how  a respondent  expects  to  vote  in 
November.  Currently  the  use  of  the  ’ leaning”  question  reduces  the 
proportion  of  undecided  by  half.  Sometimes  the  effect  of  a further  re- 
duction of  the  undecided  can  be  obtained  by  the  use  of  a secret  ballot; 
this  will  be  touched  upon  later. 

COMPARING  SURVEV  RESULTS  AND  ELECTION  RESULTS 

Three  sets  of  party-  or  candidate-preference  survey  results  have  been 
computed  for  each  of  the  past  five  national  elections,  and  the  error 
computed  for  each.  The  three  sets  are  (1)  the  total  sample,  with  a 
shift  adjustment  based  on  the  precinct  ^ote  in  a previous  election; 
(t)  those  in  the  sample  who  are  registered  or  who  plan  to  register 
and  plan  to  vote,  with  shift  adjustment;  (5)  the  accumulated-classes 
method  described  above,  with  shift  adjustment.  The  resutts  are  shown 
in  Tables  3.  4,  and  5,  and  provide  a summation  based  on  the  final 
survey  dam  illustrating  how  each  step  in  the  estimation  procedure 
affected  the  results.  As  stated  earlier,  this  summation  is  based  on  the 
vole  excluding  the  South. 

It  should  be  emphasired  that  in  actual  practice  we  are  unable  to  make 
complete  use  of  the  daia  obtained  in  the  final  pre-election  survey. 
Insufficient  time  elapses  between  the  last  day  of  interviewing  and  the 
deadline  for  the  release  of  the  results  to  subscribing  newspapers  to 
receive  the  completed  questionnaires  through  the  mail,  process  them, 
and  do  a full  analysis,  in  practice,  in  the  case  of  the  final  survey,  we 
have  a simple  basis  for  sotting  out  likely  voters  for  the  interviewers  to 
use  in  their  tabulations.  The  procedures  described  are  used  in  process- 
ing the  survey  in  early  October;  to  the  division  of  the  vote  obtained  is 
applied  any  change  observed  between  this  survey  and  the  final  one. 
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TABLE  3 


Comparison  of  Election  Results  and 
Survey  Results  in  the  Total  Sample*: 

Per  Cent  Republican  of  the  Democratic-Republican  Vote 


Election 

<N) 

Surrey 

flmifii 

Election 

Results 

Deviation 

1950 

(2,452) 

48.0 

52.7 

-4.7 

1952 

(4,814) 

53.2 

56.8 

-3.6 

1954 

(4,784) 

45.6 

50.5 

-4.9 

1956 

(6,173) 

61.8 

59.0 

+ 2.8 

1958 

(4,625) 

42.6 

46.1 

-3.5 

Mean! 

3.9 

Range f 

2.8-49 

* Collected  for  bias  In  terms  of 

a previous  election. 

f Disregarding  signs. 

TABLE  4 

Comparison  of  Election  Results  and  Survey  Results 

among  Those  Registered, 

Plan  to  Register, 

a no  Plan  to  Vote*: 

Per  Cent  Republican  of  the  Democratic-Republican  Vote 

Surrey 

Election 

Election 

(N) 

Reitifri 

Results 

Dei  to!  ion 

1950 

(1,999) 

49  2 

52.7 

-3.5 

1952 

(4,229) 

54.2 

56.8 

-2.6 

1954 

(3,733) 

47.3 

50.5 

-3.2 

1956 

(5,438) 

61.6 

59.0 

+2.6 

1958 

(3,662) 

43.8 

46.1 

-2.3 

Meant 

2.8 

Rangel 

2.3-3  3 

* Corrected  for  bias  In  terms  of  a previous  election. 
I Disregarding  signs. 


TABLE  5 

Comparison  of  Election  Results  and  Survey  Results  among 
Those  Whose  Turnout  Score  Placed  Them  in  That  Part  or  the 
Sample  Corresponding  to  the  Expected  Turnout  Proportion: 
Per  Cent  Republican  of  the  Democratic-Republican  Vote 


Election 

(N) 

Surrey 

RervJli 

Election 

Results 

Drttofton 

1950 

(1,235) 

52.4 

52.7 

-0.3 

1952 

(3,240) 

55.3 

56.8 

-I J5 

1954 

(2,410) 

50.0 

50.5 

-0.5 

1956 

(4,150) 

62.1 

53,0 

+3.1 

1958 

(2,405) 

45.9 

46.1 

-02 

Mean* 

11 

Range* 

0141 

O 

ERIC 


• Disregarding  signs. 
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In  comparing  the  two  surveys,  of  course,  the  simplified  basis  used  by 
interviewers  in  separating  likely  voters  from  nonvoters  for  their  tele- 
graphic report  is  used  for  both.  In  the  description  here,  however,  the 
final  survey  materials  as  analyzed  after  the  election  have  been  used, 
because  an  accurate  appraisal  of  the  results  of  the  methods  can  best  be 
made  with  survey  data  obtained  as  close  as  possible  to  the  election.  The 
final  survey  ’nterviewing  period  in  the  five  elections  covered  in  this 
description  has  centered  five  days  before  election  day. 

USE  OF  A SECRET  BALLOT 

Obtaining  unbiased  expressions  of  candidate  preference  is,  of  course, 
a matter  of  continuing  great  concern  in  election  surveys.  The  tradi- 
tional and  obvious  method  cf  dealing  with  the  problem  is  to  use  a 
secret  ballot.  Doing  so  in  a completely  satisfactory  way  in  the  typical 
survey  situation  present1;  difficulties,  although  the  Callup  Poll  has 
used  a secret  ballot  at  one  time  or  another  in  election  surveys  from 
1936  onward.  Early  in  1956  a method  was  worked  out  to  incorporate  a 
secret  ballot  procedure,  making  use  of  an  approach  developed  by 
Sidney  Goldish  in  the  election  surveys  of  the  Minnesota  Poll.  In  this 
approach,  the  interview  is  conducted  in  the  usual  manner  (he.  as  an 
"open”  interview)  for  alt  questionnaire  items  except  the  voting  prefer- 
ence question.  This  enables  the  interviewer  to  establish  rapport  with 
the  respondent  in  the  manner  customary  in  an  "open”  interview.  After 
asking  a series  of  intioduclory  questions,  the  respondent  is  handed  a 
card  showing  the  parties  and  candidates  and  asked  to  check  his  choice 
privately,  fold  the  card,  and  drop  it  in  a ballot  box  carried  by  the 
interviewer.  The  remainder  of  the  interview  is  carried  out  in  the  usual 
nianw  * 

The  secret  ballot  was  employed  in  two  Gallup  Poll  national  suiveys 
conducted  in  early  October  and  mid-October  of  1956  and  in  mid- 
September  and  mid-October  of  1958.  In  these  surveys  an  interpene- 
trating sample  was  used,  with  every  second  respondent  interviewed 
using  the  secret  c ucstionnaire.  A comparison  of  the  two  sets  of  data  in 
1956  showed  a 60  per  cent  lower  undecided  in  the  secret  survey  and  a 
relatively  greater  vote  for  the  Democratic  ticket  of  Stevenson  and 
Kefauver  than  was  true  in  the  nonsecrct  survey.  In  1058  the  secret 
survey  produced  30  per  cent  less  undecided  than  the  nonsecret  survey 
and  a relatively  lower  Democratic  vote.  An  application  of  the  secret* 
nonsecret  differentials  in  1956  lowered  the  Eisenhower-Nixon  per- 
centage of  the  vote  one  and  a half  percentage  points  in  the  non-South: 
in  1958  it  produced  a change  of  less  than  a percentage  point. 

Table  6 lecapitulates  the  final  estimates  that  were  obtained  ushg 
the  final  survey  data  in  the  non-South  and  utilizing  all  steps  in  tile 
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estimating  procedure,  including  the  secret  ballot  in  1956  and  1958. 
Actually,  in  1956  and  1958  the  difference  between  the  secret  and  non- 


TABLE  6 

Comparison  of  Election  Result's  and  Final  Survey  Results: 
Per  Cent  Republican  of  the  Democratic-Republican  Vote 


Election 

Suney 

Remits 

Etedlon 

Result 

Deviation 

1950 

52.4 

52.7 

-0.3 

1952 

55.3 

58.8 

-1.5 

1954 

50.0 

50.5 

-05 

1958 

60.8 

59.0 

+16 

1958 

48.3 

48.1 

+0.2 

Mean* 

05 

Range* 

0.2-1 .5 

•Disregarding  signs. 


secret  was  statistically  significant  only  in  the  case  ol  the  per  cent  unde- 
cided. Nonetheless,  the  differential  was  applied  to  the  party  vote  for 
the  final  estimate  in  both  years  and  improved  matters  in  1956. 
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CHAPTER  FOUR 


CLASS  NOTES 

The  concept  of  random  sampling  is  discussed  at  length  in  the  selection 
of  readings.  Our  purpose  here  is  to  provide  some  data  in  a convenient 
form  so  that  the  student  can  apply  what  he  has  learned- -and,  we  hope, 
in  the  process  learn  it  more  thoroughly. 

A table  of  random  numbers  consists  of  a large  number  of  digits 
jumbled  together  in  a chance  fashion.  (They  are  arranged  In  blocks  of 
five  merely  to  facilitate  reading,  in  Table  I,  taken  from  George  W. 
Snedecor,  Statistical  Methods  . ) There  are  50  rows  and  50 
columns,  each  numbered  from  00  to  49. 

Since  the  numbers  are  arranged  in  a random  fashion,  any  sequence 
of  numbers  can  be  considered  random.  a can  begin  at  the  upper  left 
corner  and  read  across,  or  read  down}  you  can  begin  at  any  randomly 
selected  point}  and  so  on. 

For  example,  to  draw  a random  sample  cf  60  cases  out  of  a list  of 

99  cases  numbered  one  to  99,  we  can  start  at  any  point — let  us  say  it  is 

row  11,  column  8.  We  then  start  by  taking  two  digit  numbers,  reading  from 

» 

left  to  right: 

04,  05,  33,  64,  71,  72,  64,  56,  90,  66,  33,  S6,  67,  92,  39,  and  so  on. 
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If  any  number  is  repeated,  we  simply  discard  it;  and  we  continue  the 
process  until  we  have  accumulated  the  50  cases  to  be  in  our  sample. 

We  have  included  a roster  of  households  in  a hypothetical  town, 
giving  the  address,  the  race  of  the  head  of  the  household,  the  number  of 
years  of  schooling  of  the  head  of  the  household,  and  the  annual  income  of 
the  head  of  the  household.  The  student  can  number  each  of  the  addresses 
in  the  town,  and  then  select  a random  sample  of  cases — 40,  let  us  say.  He 
can  then  calculate  the  characteristics  of  the  households  in  this  sample 
(race,  schooling,  and  income)  and  compare  these  to  the  distribution  of 
characteristics  in  the  town  as  a whole  (the  data  are  given  at  the  end  of 
the  roster  of  households). 

This  use  of  a table  of  random  numbers  depends,  of  course,  or.  having 
a convenient  list  of  all  the  units  in  the  population  to  be  sampled.  In  many 
instances,  unfortunately,  such  a list  does  not  exist  or  would  be  too  costly, 
or  too  time-consuming  to  construct.  In  this  event,  we  must  find  other  means 
to  select  a random  sample. 

One  such  method  can  be  illustrated  by  using  Figure  1.  We  can  assume 
for  present  purposes  thatjio  accurate  list  of  all  the  households  in  the 
town  is  available.  Our  procedure  then  might  be  as  follows: 

(a)  We  can  first  select  a random  sample  of  blocks — numbering  each  block, 
and  using  our  table  of  random  numbers  to  select  a sample  of  blocks. 


(b)  We  can  then  pick  a starting  point  in  each  block  at  random.  (If  we 
start  at  the  corner  of  each  block,  we  may  bias  our  sample.  Corner 
lots  tend  to  be  more  expensive. ) 

(c)  We  can  then  select  every  other  house  in  the  block,  every  third  house, 
etc. , depending  on  the  size  of  the  sample  desired, 

NOTE : We  might  try  to  list  all  the  households  in  each  block  in  our  sample 
of  blocks,  assign  a number  to  each  household,  and  then  select  a random 
sample.  This  would  be  preferable,  but  takes  a good  deal  more  time. 

What  happens  as  we  decrease  the  number  of  blocks  in  our  sample  of 
blocks  and  increase  the  number  of  households  selected  in  each  block,  to 
get  the  same  sample  size  ? 

What  happens  as  we  increase  the  number  of  blocks  in  our  sample  of 
blocks  and  decrease  the  number  of  households  selected  in  each  block,  to  get 
the  same  sample  size? 

What  happens  if  each  block  in  the  town  has  an  equal  chance  of  appear- 
ing in  our  random  sample  of  ^’ocks  and  if  we  take  the  same  number  of 
households  from  each  block  in  the  sample?  Will  each  household  in  the 
population  of  households  have  an  equal  chance  of  appearing  in  the  final 
sa  mple  ? 
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TABLE  1.2 

Ten  Thousand  Randomly  Assorted  Digits 


00-04 

05-09 

10-14 

15-19 

20-24 

25-29 

30-34 

35-39 

40-44 

45-49 

CO 

54463 

22662 

65905 

70639 

79365 

67382 

2 'OS  5 

69331 

47058 

08186 

01 

15389 

85205 

18350 

*39226 

42249 

90669 

90325 

23248 

60933 

26927 

02 

85941 

40756 

82414 

02015 

13858 

78030 

16269 

65978 

OI3S5 

15345 

03 

61149 

69440 

11 286 

8S21S 

5S925 

03638 

52862 

62733 

33451 

77455 

04 

05219 

81619 

10651 

67079 

92511 

59688 

84502 

72095 

83463 

75577 

05 

41417 

98326 

87719 

92294 

46614 

50948 

6488  6 

20002 

97365 

30976 

06 

28357 

94070 

20652 

35774 

16249 

75019 

21145 

05217 

47286 

76305 

07 

17783 

00015 

10S06 

85091 

91530 

36466 

39981 

62431 

49177 

75779 

08 

40950 

84820 

29831 

85966 

62300 

70326 

$4740 

62660 

77379 

90279 

09 

82995 

64157 

66164 

41180 

10089 

41757 

78258 

96488 

8S629 

37231 

10 

96754 

17676 

55659 

44105 

47361 

3*1833 

$6679 

23930 

53249 

27033 

11 

34357 

83040 

53364 

71726 

45690 

66334 

60332 

22551 

90600 

71113 

12 

06318 

37403 

49927 

57715 

50423 

673?2 

63116 

4SS88 

21505 

80182 

13 

62111 

52820 

07243 

79931 

89292 

84767 

85693 

73947 

22278 

11S51 

14 

47534 

09243 

67879 

00544 

23410 

12740 

02540 

54440 

32949 

13491 

15 

9S614 

75993 

84460 

‘ 62846 

59844 

14972 

4S730 

73443 

48167 

34770 

16 

24856 

03648 

44S98 

09351 

98795 

18644 

39765 

71058 

90363 

4410* 

17 

96887 

12479 

80621 

66223 

S6083 

7S285 

02432 

53342 

42346 

94771 

18 

90801 

21472 

42815 

77403 

37390 ' 

76760 

52615 

32141 

30268  ’ 

18106 

19 

55165 

77312 

83666 

36028 

28420 

70219 

81369 

41943 

47366 

41067 

20 

75884 

12952 

84318 

95108 

72303 

64620 

91318 

89872 

45375 

8S436 

21 

16777 

37116 

58550 

42953 

21460 

43910 

01175 

87894 

81378 

10620 

22, 

46230 

43877 

80207 

6SS77 

893-80 

32992 

91380 

03164 

9S656 

59337 

23 

42902 

66S92 

46134 

01432 

94710 

23474 

20423 

60137 

60609 

131 19 

24 

81007 

00333 

39693 

2S039 

10154 

9 542 r , 

t 

39220 

19774 

31782 

49037 

25 

68089 

01122 

51111 

72373 

06902 

74373 

96199 

97017 

41273 

21546 

26 

20411 

67031 

89950 

16914 

93054 

67687 

96693 

87236 

77054 

33848 

27 

58212 

13160 

06468 

15718 

82627 

76999 

05999 

5S680 

96739 

63700 

28 

70577 

42866 

24969 

61210 

76045 

67699 

42054 

12696 

93758 

03283 

29 

94522 

74358 

71659 

62038 

79643 

79169 

44741 

05437 

39038 

13163 

30 

42626 

£6$  19 

85651 

8S678 

17401 

03252 

99547 

32404 

17918 

62680' 

31 

16051 

33763 

57194 

16752 

54450 

19031 

58580 

47629 

54.32 

60631 

32 

08244 

27647 

33851 

44703 

9421  1 

467!6 

11738 

557S4 

95374 

72655* 

33 

59497 

04392 

09419 

89964 

51211 

04894 

72882 

17S05 

21896 

83864 

34 

97155 

1 3428 

40293 

09985 

58434 

01412 

69124 

82171 

59058 

S2SS9 

35 

98409 

66162 

95763 

47420 

20792 

61527 

20141 

39435 

11859 

41567 

36 

45476 

84SS2 

65109 

96597 

25930 

66790 

65706 

61203 

53534 

22557 

37 

89300 

69700 

50741 

30329 

11658 

23166 

05400 

6C669 

4S708 

03S87 

38 

50051 

95137 

91631 

66315 

91428 

12275 

24816 

68091 

71710 

33258 

39 

31753 

85178 

31310 

89642 

98364 

02306 

24617 

09609 

83942 

22716 

40 

79152 

53829 

77250 

20190 

56535 

16760 

69942 

77448 

33278 

48805 

41 

44560 

38750 

83635 

56540 

64900 

42912 

13933 

79149 

18710 

6S6I8 

42 

68328 

83378 

63369 

71331 

39564 

05615 

42451 

64559 

97501 

65747 

43 

46939 

38689 

58625 

03342 

30459 

$5863 

20781 

09234 

26333 

91777 

44 

83544 

86141 

15707 

9C256 

23068 

13782 

0S467 

894  69 

93842 

55349 

45 

91621 

00881 

04900 

54224 

46177 

55309 

17852 

27491 

89415 

23466 

46 

91896 

67126 

04151 

03795 

59077 

11848 

12630 

93375 

52068 

60142 

47 

55751 

62515 

2110$ 

$0830 

02263 

29303 

37204 

96926 

30506 

09808 

48 

85156 

87639 

95493 

88842 

00664 

55017 

55539 

17771 

694  ,8 

87530 

49 

07521 

56S98 

12236 

60277 

39102 

62315 

12239 

07105 

11844 

01117 

ROSTER  OF  CI7'Y  HOUSEHOLDS 


LOCATION 

RACE  OF  HEAD 

NUMBER  OF 

■N 

ANNUAL  INCOME 

OF  HOUSEHOLD 
(Male) 

YEARS  OF 
SCHOOL  OF 
H OF  H 

OF  Ii  OF  H 

First  Avenue 
3001 

White 

3003 

White 

°005 

Negro 

3007 

White 

3009 

White 

3011 

White 

3015 

White 

3017 

White 

3019 

White 

3021 

White 

3023 

White 

3025 

White 

3027 

White 

3029 

White 

3031 

White 

3033 

White 

3035 

White 

3039 

White 

17 

$ 

15,000 

18 

$ 

16,000 

18 

$ 

14,000 

17 

$ 

15,000 

18 

$ 

14, 000 

18 

$ 

14,000 

18 

$ 

15,000 

17 

$ 

14, 000 

18 

$ 

14, 000 

18 

$ 

13,000 

18 

$ 

12, 000 

18 

$ 

13, 000 

16 

$ 

8,000 

14 

$ 

8,000 

15 

$ 

8,000 

14 

$ 

7,000 

9 

$ 

7,000 

9 

$ 

7,000 

1 


LOCATION 

RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
H OF  H 

ANNUAL  INCOME 
OF  H OF  H 

Second  Avenue 
3000 

White 

18 

S 16,000 

3001 

White 

17 

$ 15,000 

3003 

White 

18 

$ 16,000 

3004 

White 

8 

$ 11,000 

3005 

White 

17 

$ 16,000 

3006 

White 

10 

$ 15,000 

3007 

White 

11 

$ 14,000 

3008 

Negro 

17 

$ 12,000 

3009 

White 

12 

$ 11,000 

3010 

White 

12 

$ 11,000 

3011 

White 

17 

$ 16,000 

3012 

White 

13 

$ 11,000 

3013 

White 

14 

$ 9,000 

3014 

Negro 

11 

$ 10,000 

3015 

White 

15 

$ 9,000 

3016 

White 

10 

$ 8,000 

3017 

White 

11 

$ 8,000 

3018 

White 

12 

$ 8,000 

3020 

White 

13 

$ 8,000 

3021 

White 

12 

$ 7,000 

3022 

White 

12 

$ 7,000 

3023 

White 

11 

$ 6,000 

o 


2 


cona  a\ 

3024 

3025 

3027 

3028 

3029 

3030 

3031 

3032 

3033 

3034 

3035 

3036 

3039 

3040 

O 

ERLC 


RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 


NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
II  OF  H 


ANNUAL  INCOME 
OF  H OF  H 


White 

11 

$ 6,000 

White 

11 

$ 6,000 

White 

12 

$ 6,000 

White 

8 

$ 5,000 

Negro 

7 

$ 8,000 

Negro 

8 

$ 5,000 

Negro 

9 

$ 6, COO 

Negro 

10 

$ 6,000 

Negro 

10 

$ 6,000 

Negro 

11 

$ 5,000 

Negro 

12 

$ 5,000 

Negro 

9 

$ 6,000 

Negro 

9 

$ 5,000 

Negro 

12 

$ 8,000 

3 


LOCATION 


RACE  OF  HEAD  NUMBER  OF  ANNUAL  INCOME 

OF  HOUSEHOLD  YEARS  OF  OF  H OF  H 

(Male)  SCHOOL  OF 

II  OF  H 


Third  Avenue 


3000 

White 

3001 

White 

3003 

White 

3004 

Negro 

3005 

White 

3006 

White 

3007 

White 

3008 

White 

3009 

White 

3010 

White 

3011 

White 

3012 

White 

3013 

White 

3014 

White 

3015 

White 

3016 

White 

3017 

White 

3018 

Whito 

3020 

Negro 

3021 

White 

3022 

White 

3023 

Whito 

O 

ERIC 

17 

$ 16,000 

17 

$ 16,000 

10 

$ 13,000 

16 

$ 11,000 

11 

$ 

8,000 

13 

$ 12,000 

11 

$ 

9,000 

12 

$ 

9,000 

13 

$ 

8,000 

11 

$ 

9,000 

11 

$ 

9,000 

11 

3 

7,000 

12 

$ 

9,000 

10 

$ 

6,000 

13 

$ 

9,000 

14 

$ 

9,000 

15 

3 

6,000 

10 

$ 

6,000 

13 

$ 

8,000 

8 

$ 

6,000 

9 

$ 

6,000 

8 

$ 

6,000 

4 


OCATIC 

'hird  Av 

3024 

3025 

3027 

3028 

3029 

3030 

3031 

3032 

3033 

3034 

3035 

3036 

3039 

3040 

O 

ERIC 


IiA'CE  of  head 

OF  HOUSEHOLD 
(Male) 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 


NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
H OF  H 

9 

10 
11 
11 
11 
12 
13 
13 

9 

8 

7 

9 

7 

7 


ANNUAL  INCOME 
OF  H OF  H 

$ 5,000 
$ 5,000 
$ 5, COO 
$ 5,000 
$ 4,000 
$ 4,000 
$ 4,000 
$ 3,000 
$ 3,000 
$ 3,000 
$ 2,000 
$ 2,000 

$ 

$ 3,000 
$ 3,000 


5 


IU.A  J.  J 

ur'h  j 

3000 

3001 

3003 

3004 

3005 

3000 

3007 

3008 

3009 

3010 

3011 

3012 

3013 

3014 

3015 

3016 

3017 

3018 

3020 

3021 

3022 

3023 

O 

ERL' 


RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
H OF  H 

ANNUAL  INCOME 
OF  H OF  H 

White 

13 

$ 7,000 

White 

’8 

$ 7,000 

White 

14 

$ 7,000 

While 

10 

$ 6,000 

White 

11 

$ 7,000 

White 

12 

$ 7,000 

White 

10 

$ 7,000 

White 

10 

$ 7,000 

White 

9 

$ 6,000 

White 

10 

$ 6,^ 

White 

10 

$ 7,000 

White 

10 

$ 7,000 

White 

10 

$ 6,000 

White 

11 

$ 7,000 

White 

10 

$ 6,000 

White 

9 

$ 5,000 

White 

10 

$ 5,000 

White 

9 

$ 4, 0C0 

Negro 

15 

$ 9,000 

Negro 

9 

$ 4,000 

Negro 

10 

$ 9,000 

Negro 

11 

$ 7,000 

c 


LOCATION 

RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
H OF  II 

ANNUAL  INCOME 
OF  II  OF  II 

Fourth  Avenue 
3024 

Negro 

11 

$ 7,000 

3025 

Negro 

11 

$ 7,000 

3027 

Negro 

S 

$ 4,000 

3028 

Negro 

9 

$ 6,000 

3029 

Negro 

8 

$ 4,000 

3030 

Negro 

10 

$ 6,000 

3031 

Negro 

8 

$ 4,000 

3032 

Negro 

8 

$ 4,000 

3033 

Negro 

7 

$ 3,000 

3034 

Negro 

6 

$ 2,000 

303. 

Negro 

6 

$ 2,000 

3036 

Negro 

7 

$ 3,000 

8039 

Negro 

m 

1 

$ 3,000 

3040 

Negro 

6 

$ 2,000 

7 

O 

ERIC 


LOCATION 

RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

Washington  Street 

101 

White 

102 

White 

103 

White 

104 

White 

105 

White 

100 

White 

201 

White 

202 

White 

203 

While 

201 

White 

205 

While 

206 

White 

301 

White 

302 

White 

303 

White 

304 

White 

305 

White 

306 

White 

401 

White 

402 

White 

403 

White 

404 

White 

405 

O 

White 

ERIC 

White 

NUMBER  OF  ANNUAL  INCOME 

YEARS  OF  OF  H OF  H 

SCHOOL  OF 
II  OF  II 


16 

17 

17 

10 

iG 

15 

IG 

14 

17 

14 

15 
IG 
15 

11 
H 

12 
II 
10 
11 
11 
10 
10 
10 
10 


$ 15,000 
$ 14,000 
$ 14,000 
$ 13,000 
$ 13,000 
$ 13,000 
$ 15,000 
$ 11,000 
$ 16,000 
$ 15,000 
$ 12,000 
$ 11,000 
$ 10,000 
$ 7,000 
$ 9,000 
$ 7,000 
$ 9,000 
$ 7,000 
$ 7,000 
$ 6,000 
$ 7,000 
$ 6,000 
$ 6,000 
$ 5,000 


8 


101 

102 

103 

104 

105 

106 

201 

202 

203 

204 

205 

206 

301 

302 

303 

304 

305 

306 

401 

402 

403 

404 

405 

406 

O 

KL( 


RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

White 

White 

White 

White 

White 

White 

Whito 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

Negro 

White 

White 

White 

White 

White 

White 


NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
H OF  H 
11 

12 

13 

14 
II 
II 

11 
13 

13 
It 

12 
11 

11 
11 
11 
11 
11 

14 
11 

12 
12 
12 
II 
11 


ANNUAL  INCOME 
OF  H OF  II 

$ 13,000 
$ 13,000 
$ 12,000 
$ 12,000 
$ 11,000 
$ 10,000 
$ 10,000 
$ 10,000 
$ 10,000 
$ 8,000 
$ 10,000 
$ 7,000 
* 7,000 
$ 6,000 
$ 6,000 
$ 7,000 
$ 6,000 
$ 8,000 
$ 6,000 
$ 5,000 
$ 5,000 
$ 6,000 
$ 5,000 
$ 5,000 


9 


LOCATION 

Jefferson  Street 
101 

102 

103 

104 

105 
10G 
201 
202 

203 

204 

205 
20G 

301 

302 

303 

304 

305 

306 

401 

402 

403 

404 

405 

406  ; 

O 

ERIC 


RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

Negro 

Negro 

Negro 

White 

White 


NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
II  OF  H 


ANNUAL  INCOME 
OF  H OF  H 


1G 

$ 12,000 

15 

$ 11,000 

!6 

$ 

8,000 

13 

$ 10,000 

11 

$ 

G,  000 

14 

c> 

10,000 

11 

$ 

7,000 

11 

$ 

G,  000 

11 

$ 

G,  000 

11 

$ 

5,000 

II 

$ 

6,000 

12 

3 

7,000 

12 

$ 

5,000 

13 

$ 

5,000 

12 

$ 

5.000 

11 

$ 

4,000 

12 

$ 

4,000 

12 

$ 

4,000 

10 

$ 

G,  000 

11 

$ 

6,000 

12 

$ 

6,000 

12 

$ 

4,000 

12 

$ 

4,000 

10 

s 

6,000 

Negro 
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CAT 

r.nt  S 

101 

102 

103 

104 

105 

10G 

201 

202 

203 

204 

205 

20G 

301 

302 

303 

304 

305 

30G 

401 

402 

403 

404 

405 

406 
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RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

White 

Negro 

White 

Negro 

Negro 

Negro 

Negro 

Negro 

Negro 


NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
II  OF  II 
12 

14 
12 

13 

15 

15 

15 

15 

14 
13 

13 

14 
13 
13 
13 
13 
12 
11 
10 
12 
12 

9 

7 

8 


ANNUAL  INCOME 
OF  II  OF  II 

$ 12,000 
$ 7,000 
$ 10,000 
$ 7,000 
$ G, 000 
$ 6,000 
S 5,000 
$ G, 000 
$ 5,000 
$ 3,000 
$ 5,000 
$ 5,000 
$ 4,000 
$ 4,000 
$ 4,000 
$ 3,000 
$ 4,000 
$ 7 , 000 
$ 5,000 
$ 4,000 
$ 4,000 
$ 4,000 
$ 4,000 
$ 3,000 


II 


LOCATION 
Harding  Street 

RACE  OF  HEAD 
OF  HOUSEHOLD 
(Male) 

NUMBER  OF 
YEARS  OF 
SCHOOL  OF 
H OF  II 

ANNUAL  INCOME 
OF  H OF  H 

101 

White 

14. 

$ 

7,000 

102 

White 

14 

$ 

7,000 

1C3 

White 

14 

$ 

6,000 

104 

White 

1G 

$ 

G,  000 

105 

White 

14 

$ 

5,000 

I0G 

White 

14 

$ 

6,000 

201 

White 

14 

$ 

5,000 

202 

White 

14 

$ 

5,000 

203 

White 

15 

$ 

5,000 

204 

White 

12 

$ 

4,000 

205 

White 

9 

$ 

4,000 

206 

White 

6 

$ 

4,000 

301 

Negro 

8 

$ 

G,  000 

302 

Negro 

G 

$ 

3,000 

303 

White 

8 

$ 

5,000 

304 

Negro 

G 

$ 

4,000 

305 

Negro 

8 

$ 

5,000 

30G 

Negro 

7 

$ 

4,000 

401 

Negro 

6 

$ 

3,000 

402 

Negro 

7 

$ 

2,000 

403 

Negro 

8 

$ 

2,000 

404 

Negro 

8 

$ 

3,000 

405 

Negro 

G 

$ 

3,000 

406 

0 

ERIC 

Negro 

6 

12 

$ 

2,000 

Total  Income 

$ 1,838,000 

Total  Number  of  Households 

24  G 

Per  Cent  Negro  Households 

22.4 

Mean  Income 

$ 7,470 

Mean  Years  of  Schooling 

10,7 

13 


CHAPTER  V 


THE  PROBLEM  OF  INFERENCE 

1 

In  the  previous  chapter  it  was  pointed  out  that  wo  are  frequently 
faced  with  the  necessity  of  making  estimates  or  statements  based  on  a 
sample  drawn  from  a larger  population  or  universe.  It  is  the  technique 
of  making  these  estimates!  commonly  called  the  process  of  inference, 
which  now  concerns  us. 

For  some  law  students,  the  concepts  and  the  notational  system  and 
the  computations  we  will  be  dealing  with  will  be  perfectly  familiar.  They 
"*,e,  in  fact,  relatively  simple  and  straight  forward  and  they  require  no 
more  mathematical  background  than  the  ability  to  add,  subtract,  multiply, 
and  divide. 

Before  we  lum  to  the  process  of  statistical  Inference,  however,  it 
is  necessary  for  the  student  to  understand  a number  of  basic  ideas  such 
as  mean,  variance,  normal  distribution,  and  so  on.  Any  one  of  the  books 
on  elementary  statistics  listed  in  Appendix  A will  provide  a clear  and 
concise  explanation. 
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Two  of  these  ideas — that  of  the  standard  deviation  and  the  normal 
curve— are  really  essential  for  understanding  the  process  of  statistical 
inference}  and  it  is  this  process  which  is  discussed  by  John  Mueller  and 
Karl  Schuessler  in  the  selection  from  Statistical  Reasoning  in 
Sociology. 

The  most  crucial  idea  in  this  selection  by  Mueller  and  Schuessler 
involves  the  null  hypothec  is.  This  is  an  idea  that  will,  in  all  probability, 
provide  some  difficulty  to  the  law  student— and  for  most  students,  for  that 
matter,  if  our  experience  is  any  guide.  The  law  student,  however,  has  one 
advantage,  for  the  concept  of  the  null  hypothesis  will  probably  be  familiar  to 
him  in  another  guise — namely,  the  judgment  of  guilty  beyond  a reasonable 
doubt  as  it  appears  in  criminal  trials. 

Now  many  legal  writers  are  agreed  that  the  precise  meaning  of  "beyond 
a reasonable  doubt"  is  far  from  clear  and  that  it  is  a notion  which  has  con- 
fused many  a Jury,  liut  one  way  to  look  at  it,  we  think,  is  along  the  follow- 
ing line  of  reasoning.  The  defendant  in  a criminal  trial  is  presumed  "innocent 
until  proven  guilty.  " The  prosecution  then  presents  a body  of  evidence  show- 
ing that  the  defendant  Is  guilty  of  the  crime  with  which  he  Is  charged— a body  of 
evidence  some  part  of  which  still  stands  after  the  defense  attorney  has  held  it 
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up  for  criticism  and  presented  his  countervailing  facts.  Now  the  Jury  must 
ask  itself:  What  are  the  chances  that  this  array  of  evidence  pointing  to  guilt 
could  have  arisen  if  in  fact  the  defendant  is  innocent? 

If  it  appears  that  the  defr  lant  could  bo  innocent  and  yet  there  is  still 
a good  chance  that  such  an  array  of  evidence  pointing  to  guilt  could  arise 
fairly  often  (50  out  of  100  times,  let  us  say)  the  jury  may  bo  very  reluctant 
to  bring  in  a verdict  of  guilty— the  jury  would  rather  cling  to  the  assumption 
of  innocence.  Indeed,  If  such  an  array  of  evidence  pointing  to  guilt  would 
arise  relatively  rarely  (only  20  out  of  100  times,  let  us  say)  when  the  de- 
fendant was  innocent,  the  jury  still  may  not  feel  it  has  enough  grounds  to 
convict,  i.e. , reject  the  assumption  of  innocence.  But  if  the  chances  are 
very  slim  indeed  that  an  inr.ocent  man  would  every  be  confronted  with  such 
proof  of  guilt  (onco  out  of  100  times,  perhaps)  the  Jury  may  feel  that  it  must 
reject  Its  assumption  of  innocence  and  declare  the  defendant  a criminal.  The 
members  of  the  Jury  have  been  convinced,  we  say,  beyond  a reasonable  doubt. 

In  criminal  trials,  of  course,  there  is  almost  never  an  opportunity 
to  attach  any  precise  figure  to  the  chances  of  a body  of  evidence  as  a whole 
pointing  to  guilt  occurring  and  the  jury  is  forced  to  rely  on  more  impression- 
istic evaluations.  Nonetheless,  the  logical  process  of  making  a judgment 
about  guilt  in  a criminal  trial  and  testing  a null  hypothesis  in  statistics  seem 
to  show  many  similarities. 
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The  last  selection  In  this  chapter  deals  with  the  application  of  sampling 
theory  and  statistical  inference  to  the  problem  of  discrimination  in  the 
selection  of  Juries.  It  represents,  we  think  an  outstanding  illustration  of 
how  the  disciplines  of  law  and  the  social  sciences  can  and  must  work  on 
certain  problems  arising  in  a large,  complex  society.  Certainly  che  social 
sciences  by  themselves  would  find  it  difficult  to  see  exactlv  ) ow  their  know- 
ledge of  quantitative  methods  and  patterns  of  discrimination  could  be  made 
most  relevant  for  the  kgal  disputes  about  the  composition  of  the  Jury.  And 
there  is  little  doubt  but  that  the  law  alone  would  find  problems  in  seeing 
both  the  possibilities  and  the  limitations  of  ".01"  levels,  etc.  When  they 
Join  forces,  however,  they  can  exert  great  leverage  in  the  maintenance  of 
individual  rights  and  the  use  of  law  for  social  ends.  And  the  possibility  of 
fruitful  cooperation  Justifies,  in  our  opinion,  the  exposure  of  the  law  student 
to  materials  which  are  likely  to  seem  somewhat  outside  his  regular  course  of 
study. 


. 4 . 


STATISTICAL  BACKGROUND 


Sampling,  as  we  have  seen,  Is  a useful  technique  for  obtain- 
ing information  about  the  characteristics  of  a universe.  The  informa- 
tion derived  from  a sample,  however,  can  only  give  an  estimate  of  the 
characteristics  present  in  the  universe.  The  fact  that  a random  pro- 
cedure is  used  in  drawing  a sample  helps  to  insure  that  the  sample  is  not 
subject  to  systematic  bias,  but  the  results  of  any  given  sample  are,  In 
all  probability,  In  error  to  some  degree.  That  is,  sampling  does  not 
give  an  exact  pictui'e  of  the  characteristics  under  observation.  For 
example,  Mueller  and  Schucssler,  in  the  following  reading,  present 
a universe  of  suicide  rates  in  107  American  cities;  the  mean  suicide 
rate  of  all  the  cities  Is  12.4.  They  then  draw  100  random  samples  from 
the  universe,  four  of  which  coincide  with  the  true  meant  and  "approxi- 
mately two-thirds  of  the  100  means  fall  between  P.  4 and  13. 3j  or 
virtually  within  one  point  of  the  true  mean.  " What  Is  significant  here 
is  that  sampling  provides  us  with  a fairly  accurate  estimation  of  the 
parameter,  yet  a large  proportion  of  samples  are  in  error.  It  is  the 
procedure  for  dealing  with  such  sampling  errors  that  is  discussed  by 
Mueller  and  Schuesster. 


O 

ERLC 


l 


From  John  H.  Mueller  and  Karl  F.  Schuessler, 
Statistical  Reasoning  in  Sociology , 
(Boston:  Houghton  Mifflin  Company,  1961) 
pp.  361-379;  388-391.  Reprinted  by  permission. 
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John  H.  Mueller  and  Karl  F.  Schuessler,  Statistical 
Reasoning,  pp  361-379;  388-391. 


Section  O ;ve 

Parameter  Estimation 


Pre&.kms  in  Estimation*  It  should  bo  unnecessary  to  repeat  that  a sample 
is  useful  only  for  the  information  it  supplies  on  the  characteristics  of  the 
universe.  Thus,  the  sampling  process  finds  its  ultimate  consummation  in 
a description  of  the  population  from  which  the  sample  is  drawn*  But  this 
description  can  be  only  an  approximation:  an  average  monthly  expenditure 
of  $92.75  in  a random  sample  of  college  students  will  not  correspond  exactly 
to  the  average  expenditure  of  the  whole  student  body  which  the  sample 
is  designed  to  represent.  Nor,  obvioush  will  a second  sample  of  $$$.62 
necessarily  correspond  more  closely  to  the  unknown  parameter.  If,  there- 
fore, owing  to  the  vagaries  of  chance  sampling,  no  two  sampler  arc  alike 
and  all  arc  in  error,  with  how*  much  confidence  can  vre  speak  of  the  value 
of  the  universe?  Clearly,  we  cannot  merely  project  the  value  of  the  sample 
onto  the  universe,  and  lot  it  go  at  that.  This  procedure  of  adopting  the 
sample  value  in  lieu  of  the  parameter,  or  universe  value,  is  hedged  about 
with  as  many  regulations  as  is  the  procedure  of  sampling  itself.  We  call 
this  set  of  prescribed  procedures  statistical  inference. 

Since,  in  the  practical  affairs  of  life,  it  is  so  rare  that  we  are  able  to  exam- 
ine a whole  universe,  and  since  sampling  is  so  frequently  the  resort,  the 
procedures  of  statistical  inference  loom  quite  large  in  the  repertory  of 
statisticians.  In  fact,  some  would  actually  identify  the  science  of  statistics 
with  the  problem  of  sampling  and  decision-making  — an  extreme  emphasis 
which  seems  to  the  authors  unrealistic,  since  descriptive  statistics  have  a 
validity  and  importance  in  their  own  right. 

The  problems  of  statistical  inference  begin  with  the  legitimate  assump- 
tion of  a discrepancy  between  the  variable  simple  estimate  and  the  uni- 
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verse  parameter,  which  is  constant.  When  this  discrepancy  is  the  result 
of  random  sampling,  we  call  it  a sampling  error . If  the  average  monthly 
expenditure  of  all  students  is  $95.50  and  the  sample  mean  is  $92.75,  the 
sampling  error  is  $2.75.  In  general,  the  sample  rarely  provides  us  with  a 
perfect  description  of  the  universe,  for  the  reason  that  few  if  any  sample 
values  are  entirely  free  of  sampling  error. 

Sampling  Distribution*  Even  though  such  errors  cling  to  almost  every 
sample  statistic,  there  is  no  reason  to  he  dismayed  by  such  a natural  con- 
sequence of  random  sampling.  These  errors,  when  taken  in  the  mass,  do 
not  behave  chaotically;  on  the  contrary,  they  behave  systematically,  they 
exhibit  a characteristic  distribution,  and  tlu.y  are  therefore  conveniently 
amenable  to  analysis.  There  is  not  only  a definite  zone  surrounding  the 
parent  value  beyond  which  the  variable  samples  cannot  stray,  but  in 
addition  the  variety  of  sample  values  distribute  themselves  in  a recogniz- 
able pattern  which  permits  statistical  inferences  to  be  confidently  drawn. 
We  can,  for  example,  bo  certain  that  the  average  age  of  a sample  of  high 
school  students  will  not  fall  below  10,  or  exceed  21 ; and  reasonably  certain 
that  it  will  not  fall  short  of  14  or  surpass  17.  We  can  assert  this  because 
we  are  so  thoroughly  familiar  with  the  ages  of  high  school  students.  But 
whether  or  not  wc  are  intimately  familiar  with  the  universe,  we  can  still 
be  assured  that  the  sample  value  v. ill  always  bear  some  resemblance  to 
the  corresponding  universe  value.  We  can  proceed  with  confidence  that  the 
statistic  will  always  adhere,  to  a greater  or  lesser  extent,  to  its  parameter. 

To  bring  this  very  important  fact  home  to  us,  we  shall  illustratively 
take  a known  universe,  extract  a large  number  of  samples  from  it,  calculate 
the  mean  for  each,  and  then  compare  these  means  with  the  population 
mean.  Wc  will  then  have  demonstrated  to  what  extent  the  sample  means 
actually  digress  from,  or  converge  on,  the  parameter  itself.  Of  course,  in 
an  actual  problem,  we  will  not  have  knowledge  of  the  universe  mean,  nor 
wall  wc  take  a large  number  of  samples.  Instead  we  will  be  ignorant  of  the 
true  mean  and  wilt  take  only  one  sample,  which  we  will  have  to  make  do. 
But  with  the  background  afforded  by  this  experiment,  wc  wall  be  fortified 
with  knowledge  of  how  samples  in  general  behave,  all  of  which  will  help 
us  to  infer  how  the  particular  sample,  which  we  have  drawn,  Ls  probably 
behaving. 

For  our  working  universe,  we  shall  take  the  previously  analyzed  suicide 
rates  of  107  American  cities  (Table  3.1.1a).  From  this  universe  we  shall 
(1)  draw  100  random  samples  — restoring  each  sample  to  the  universe 
before  drawing  another  — and  (2)  compute  the  mean  of  each  sample. 
This  experiment  ought  to  be  sufficient  to  reveal  something  about  the 
regularity  in  the  behavior  of  the  seemingly  wayward  samples.  These 
experimental  means  are  arrayed  in  Table  12.1.1a  and  grouped  hi  Table 
12.1.1b. 
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Tabic  12.1.1a  Array  of  100  Sample  Means,  n » SO 


9.9 

11.6 

12.2 

12.7 

13.1 

10.0 

11.6 

12.2 

12.7 

13.2 

10.3 

11.7 

12.2 

12.8 

13.2 

10.3 

11.7 

12.2 

12.S 

13.2 

10.7 

11.8 

12.3 

12.8 

13.3 

10.S 

11.8  * 

12.4 

12. S 

13.3 

10.8 

11.8 

12.4 

12.9 

13.3 

10.9 

11.8 

12.4 

12.9 

13.3 

10.9 

11.9 

12.4 

12.9 

13.4 

11.0 

12.0 

12,5 

12.9 

13.5 

11.0 

12.0 

12.5 

12.9 

13.5 

11.1 

12.0 

12.5 

13.0 

13.5 

11.1 

12.0 

12.6 

13.0 

13.6 

11.1 

12.0 

12.6 

13.0 

13.7 

11.3 

12.1 

12.6 

13.0 

13.S 

11.4 

12.1 

12.6 

13.1 

13.9 

11.5 

12.1 

12.6 

13.1 

14.0 

11.5 

12.1 

12.6 

13.1 

14.0 

11.5 

12.2 

12.6 

13.1 

14.2 

11.0 

12.2 

12.6 

13.1 

14.3 

Frequency  Tally , Empirical  Sampling  Distribution, 
Table  12.1.1b  100  Sample  Means , n - 50 


Class 

Interval 

Talitv 

/ 

9.0-  9.9 

/ 

i 

10.0-10.9 

M III 

8 

11.0-11.9 

mmmm 

20 

12.0-12.9 

■mm  mm  m m m m n 

42 

13.0-13.9 

■m m mm  m 

25 

14.0-14.9 

llll 

4 

100 

Inspecting  this  lot,  we  sec  that  only  4 of  the  100  samples  coincide  exactly 
with  the  universe  mean,  known  to  be  12.4.  All  others  contain  a sampling 
error;  in  fact,  every  one  of  them  would  probably  have  shown  a sampling 
error  if  we  had  insisted  on  greater  decimal  accuracy.  We  can  see,  however, 
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(hat  the  universe  mean  is  snugly  nested  in  the  very  middle  of  the  whole 
unay  of  sample  means,  and  that  most  of  the  sample  means  are  compactly 
huddled  around  the  true  mean  of  12.4,  which  is  practically  at  the  50  per 
cent  division  point.  Approximately  two-thirds  of  the  100  means  fall 
between  11.4  and  13.3,  or  virtually  within  one  point  of  the  true  mean. 
Characteristically,  the  sample  menus  gravitate  toward  the  parent  mean, 
or  the  center  of  the  universe  distribution. 

It  now  becomes  evident  how  improbable  it  is  that  a sample  mean  Will 
deviate  seriously  from  the  universe  mean.  Furthermore,  not  only  do 
sample  means  cluster  around  the  true  mean,  but  their  pattern  of  distribu- 
tion takes  on  a shape  unmistakably  approaching  the  normal  curve  (Figure 
12.1,1).  If  we  had  drawn,  processed,  and  tabulated  all  * possible  samples, 


Fiquke  12,1.1  Histogram  oj  100  Sample  Means,  n = SO 


instead  of  merely  100  of  them,  we  would  have  a pattern  of  distribution 
which  would  display  still  greater  conformity  to  the  smooth  normal  curve. 
This  hypothetical  frequency  curve  of  all  possible  sample  means  is  labeled 
the  theoretical  sampling  distribution,  and  has,  as  its  mathematical  model, 
the  ideal  normal  curve.  Any  experimental  distribution  of  a limited  mim- 
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ber  of  samples  is  called  an  empirical  sampling  dislribviion.  The  theoretical 
sampling  distribution  would,  of  course,  be  attained  only  by  an  infinite 
number  of  samples,  since  it  would  require  that  many  to  assure  the  dis- 
tribution of  sample  means  in  their  proper,  expected  proportions.  The 
significant  procedural  point,  however,  in  this  type  of  problem,  is  that  we 
can  usually  treat  the  thcorcticad  sampling  distribution  of  means  as  if  it 
wore  a perfect,  smooth,  ideal  normal  curve. 

Distribution  of  Sampling  Errors . By  subtracting  the  universe  mean  from 
each  of  the  100  sample  menus,  we  obtain  the  100  sampling  errors  (Table 
12.1.2a),  which  of  course  have  the  same  curve  pattern  as  the  means  them- 
selves (Table  12.1. 2b).  All  we  have  done  is  move  the  zero  origin  to  the 
universe  mean  (12.4). 


Array  of  Sampling  Errors,  100 
Table  12.1.2a  Samples , n = SO 


-2.5 

-.3 

— .2 

+ .3 

+ .7 

2.4 

.8 

.2 

.3 

JS 

2.1 

.7 

.2 

.4 

.8 

2.1 

.7 

.2 

.4 

,8 

1.7 

.6 

.1 

.4 

.9 

1.6 

.6 

0 

.4 

.9 

1.6 

.6 

0 

.5 

.9 

1.5 

.6 

0 

.5 

.9 

1.5 

.5 

0 

.5 

1.0 

1.4 

.4 

+.1 

.5 

l.l 

1.4 

.4 

.1 

.5 

U 

1.3 

.4 

.1 

.6 

l.l 

1.3 

.4 

.2 

.6 

1 2 

1.3 

.4 

.2 

.6 

1.3 

1.1 

.3 

.2 

.6 

1.4 

1.0 

.3 

mo 

.7 

1.5 

.9 

.3 

.2 

.7 

1.5 

.9 

.3 

.2 

.7 

1.6 

.9 

.2 

.2 

.7 

1.3 

.8 

.2 

.2 

.7 

1.9 

The  distribution  of  sampling  errors  (Table  12.L2b)  again  demonstrates 
that  the  small  deviations  are  very  numerous;  the  larger  discrepancies  are 
few  in  number.  If  we  had  drawn  a single  sample  only,  wc  could  be  prac- 
tically certain  that  our  sample  mean  would  not  have  missed  the  true  mean 
by  more  than  two  points,  plus  or  minus;  indeed,  G8  per  cent  of  the  samples 
arc  in  error  by  less  than  plus  or  minus  one. 
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Table  12.1 .2b 

Frequency  Distribution  of  Sam- 
pling Errors 


Class  Interval 

i 

-3.4-(— 2.5) 

1 

— 2.4— C — 1.5) 

8 

— 1.4-  .5) 

20 

- A-  .5 

42 

.6-  1.5 

25 

1.6-  2.5 

_J 

100 

The  presence  of  sampling  errors  obviously  complicates  the  task  of  draw- 
ing an  inference  about  the  parameter;  but  the  neat  pattern  of  these  errors 
renders  the  uncertainties  of  such  inference  less  formidable.  Our  dissection 
of  the  empirical  sampling  distribution  has  given  us  considerable  reassur- 
ance of  the  reasonableness  of  the  behavior  of  samples,  which  will  permit 
us  to  estimate  the  degree  of  confidence  of  which  the  single  sample  is  worthy. 

Sample  Reliability.  \\ e must  now  take  up  the  technique  which  will  make 
appropriate  allowance  for  the  ever  present  sampling  error,  and  thereby 
enable  us  to  make  reliable  estimates  of  the  parameter.  For,  unless  we  can 
rely  with  a certain  degree  of  confidence  o; i our  inferences,  there  Is  no  point 
in  making  them  at  all.  The  entire  purpose  of  sampling  is  to  substitute  a 
sample  of  given  reliability  for  the  prohibitively  complete  enumeration  of 
the  unknown  universe. 

But  we  cannot  measure  the  sampling  error  of  our  single  sample  directly 
since  we  are  never  given  the  true  mean  from  which  to  compute  it.  That 
would  suppose  that  we  already  have  what  we  have  set  out  to  find  in  the 
first  place.  The  single  sample  must  ultimately  therefore  provide  two 
kinds  of  information,  as  can  be  deduced  from  the  foregoing  analysis: 
(1)  an  estimate  of  the  parameter,  and  (2)  the  degree  of  confidence  we  may 
place  in  that  estimate,  or  the  reliability  of  the  estimate  — in  our  example, 
the  reliability  of  the  mean. 

The  procedures  for  measuring  the  reliability  of  an  estimate  arc  numerous, 
since  they  differ  according  to  the  sample  statistic  (for  example,  whether 
mean  or  percentage)  and  according  to  the  make-up  of  the  sample  (whether 
large  or  small,  simple  or  .stratified).  Here,  we  present  only  two  of  the 
simplest  techniques  for  measuring  sample  reliability;  these  apply  to  means 
and  percentages  of  large  random  samples,  respectively.  They  will  suffice 
for  our  purposes,  since  other  techniques  produce  results  which  in  principle 
cany  the  same  interpretation  as  the  method  here  unfolded. 


Interval  Estimate  of  the  Mean.  Let  us  suppose  that,  lit  taking  a single 
sample  (n  = 30)  from  the  107  suicide  rates,  wc  obtain  a mean  of  13.5.  In 
such  a realistic  situation,  we  do  not  know  the  value  of  the  true  mean,  and 
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will  never  know  it.  At  this  stage  of  the  inquiry,  13.5  is  the  only  estimate 
we  have  of  the  true  mean,  and  therefore  it  is  our  “best”  estimate.  We 
call  this  a point  estimate.  No  one  would  claim  that  this  estimate  is  any 
more  than  approximately  correct.  The  weakness  of  the  point  estimate  is 
that  it  provides  no  indication  of  the  sampling  error,  end  consequently  we 
do  not  know  how  much  confidence  to  accord  it.  This  disadvantage  is 
overcome  by  setting  up  an  interval  which  probably  contains  the  true  mean, 
and  which  we  therefore  label  an  interval  estimate.  To  construct  such  an 
interval,  we  (1)  estimate  the  range  within  which  a high  proportion  of 
sampling  errors  lie,  and  (2)  add  and  subtract  this  range  of  error  to  and 
from  our  observed  sample  mean  to  form  the  interval  which  probably  en- 
closes the  true  mean. 

This  attack  on  the  problem  is  analogous  to  the  following  homely  situa- 
tion. Encountering  a man  on  the  street,  we  may  estimate  his  age  to  be 
35  (point  estimate).  Upon  being  challenged,  we  reply:  “Of  course,  this 
is  only  an  estimate  based  on  cues  that  I recognize.  I feel  reasonably  con- 
fident, however,  from  samples  of  men  I have  known  that  he  is  somewhere 
between  30  and  40  (interval  estimate);  and  feel  still  more  confident  that 
he  is  between  25  and  45;  and  absolutely  certain  that  his  age  will  be  em- 
braced between  20  and  50.”  These  intervals  enjoy  varying  degrees  of 
confidence,  and  for  that  reason  are  more  realistically  known  as  confidence 
intervals*  The  larger  the  interval,  the  more  confidence  we  have  that  it 
includes  the  true  age.  The  age  of  the  gentleman  is,  of  course,  fixed.  It  is 
what  it  is,  even  though  we  arc  not  completely  informed.  But  as  we  shift 
the  size  of  the  interval,  contracting  and  expanding  it  around  the  point 
estimate,  we  revise  the  degree  of  our  confidence  that  the  interval  will 
enclose  his  true  age.  To  be  sure,  the  initial  estimate  of  35  years  may  have 
been  brilliantly  correct,  but  we  shall  never  have  the  satisfaction  of  know- 
ing ‘t  unless  we  ask  him  directly  — which  under  our  assumptions  we  cannot 
do.  So  it  is  with  sampling. 

Wc  already  know  how  to  draw  a random  sample;  all  we  need  to  learn 
in  addition  is  how  to  construct  the  aforementioned  confidence  interval, 
the  first  step  of  which  is  to  establish  the  value  of  its  unit  of  measure.  This 
is  the  average  by  which  all  samples  deviate  from  the  unknown  parameter, 
and  is  called  the  standard  error. 

27 tc  Logic  of  the  Standard  Error  of  the  Mean.  Our  demonstration  has  re- 
vealed that  the  theoretical  sampling  distribution  of  means  may  be  viewed 
as  a normal  curve  (Figure  12.1.1),  with  the  true  mean  at  the  center,  as  the 
grand  mean  of  means.  Statisticians  express  this  truth  more  formally: 
“The  sampling  distribution  of  the  mean  is  approximately  normal  around 
the  true  mean,  provided  n > 30.”  Now,  wc  do  not  know'  whether  the 
mean  of  the  sample  which  we  happened  to  draw  lies  near  the  true  mean,  a 
sigma  removed  from  it,  or  at  any  other  point  on  the  base  line;  perhaps  it 
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is  even  three  sigmas  removed.  Hut,  i ’ v.'e  had  the  value  of  the  standard 
deviation  of  the  sampling  distribution,  rro  could  set  up  an  interval  estimate 
for  the  true  mean;  hence,  we  must  obtain  an  estimate  of  this  value.  We 
label  this  value,  not  a standard  deviation  ( SD),  but  more  descriptively  a 
standard  error  ( SE)t  since  the  deviations  of  all  sample  means  from  the  true 
mean  arc  actually  sampling  errors. 

Our  sample  alone  must  furnish  the  information  on  which  to  base  an 
estimate  of  the  standard  error,  since  it  is  all  wo  have.  Hcncc,  we  must 
explore  the  sample  to  uncover  some  cues  to  the  magnitude  of  the  sought- 
for  standard  error.  Initially,  we  might  guess  that  the  size  of  the  sample 
(n)  and  the  degree  of  scatter  in  the  sample  (SD)  arc  related  to  the  size  of 
the  standard  error,  according  to  the  following  reasoning. 

First,  as  to  sample  size,  we  should  expect  larger  samples  to  be  move 
representative  of  the  universe  than  smaller  samples,  and  we  would  there- 
fore expect  larger  samples  to  yield  smaller  sampling  errors.  In  fact,  when 
a sample  contains  every  last  item  in  a finite  universe  (a  100  per  cent  sample), 
there  can  be  no  sampling  error  at  all.  Every  “sample”  moan  would  then 
necessarily  be  identical  with  the  universe  mean,  and  the  standard  error 
would  be  zero.  At  the  other  extreme,  the  smallest  possible  sample  would 
contain  only  one  ease  (n  « 1).  The  distribution  of  sample  means  would 
then  be  identical  with  that  of  individual  universe  values,  and  would  dis- 
play the  same  amount  of  variation  as  the  items  in  the  universe  itself. 
Hence,  in  that  instance,  the  standard  error  would  be  equal  to  the  standard 
deviation  of  the  universe  (cr).  From  this  brief  analysis,  we  may  conclude 
that  the  standard  error  of  the  mean  (a)  will  always  be  smaller  than  the 
standard  deviation  of  the  universe,  since  n will  always  be  larger  than  1; 
and  (b)  will  decrease  as  sample  size  (zri  increases,  reaching  zero  when 
n *=  100  per  cent  of  the  universe. 

Second,  we  intuitively  expeel  the  degree  of  scatter  in  the  universe,  as 
reflected  by  the  degree  of  scatter  in  the*  sample,  to  affect  the  size  of  the 
sampling  errors.  Thus,  if  all  values  in  rhe  universe  were  alike,  all  sample 
means  would  also  be  alike,  and  there  would  he  no  sampling  error  whatever. 
The  standard  error  of  sucli  a completely  homogeneous  universe  would 
always  be  zero!  A sample  of  one  boy  would  give  us  without  sampling 
error  the  average  number  of  arms  for  .all  boys,  since  every  boy  lias  the 
same  number  of  arms  — a perfectly  homogeneous  universe;  however,  a 
sample  of  one  boy  would  not  give  us  vri.hout  error  the  average  height  of 
all  boys.  There  is  a great  variety  of  heights,  which  produces  comparable 
variety  within  samples,  and  consequently  among  the  sample  means  as 
well.  Inverting  this  argument,  we  conjc  :ture  that  a high  degree  of  scatter 
in  the  sample  is  indicative  of  heterogeneity  in  the  universe  and  therefore 
of  large  sampling  errors.  We  thus  anticipate  that  the  standard  error  for 
a given  « will  be  larger  when  sampling  a very  heterogeneous  mass  of  data 
than  when  sampling  a relatively  homogeneous  mass. 
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Compviation  oj  the  Standard  £7;  or,  Both  of  the  foregoing  insights  are  in 
accord  with  statistic!  theory,  and  are  nenially  explicit  in  the  formula  for 
the  standard  error  of  the  mean,  which  contain?  n and  <r  as  its  baeic  terms. 
Without  further  proof,  we  give  the  formula: 

<rs  = -£=  (12.1.1) 

Vn 

where  <?x  * standard  error  of  the  mean 

cr  = standard  deviation  of  the  universe 
n * sample  size 

Of  course,  we  do  not  have  the  standard  deviation  of  the  universe,  as 
required  by  the  formula;  all  we  have  is  the  standard  deviation,  $,*  of  the 
sample.  We  will  therefore  substitute  it  for  the  unknown  parameter  in  the 
above  formula. 

s*  = 77=  (12.1.2) 


But  the  value  of  the  substituted  standard  deviation  of  the  sample  is 
not  identical  with  that  of  the  universe,  owing  to  sampling  error;  in  fact, 
the  former  tends  to  be  smaller  than  the  latter  because  the  scatter  of  values 
within  the  sample  tends  to  be  smaller  than  that  in  the  universe.  To  match 
this  understatement  in  the  numerator  (i.e.,  to  maintain  the  ratio  in  the 
fraction),  we  correspondingly  reduce  sample  size  (n)  by  one  in  the  de- 
nominator: 


(12.1.3) 


This  formula  may  fittingly  be  substituted  for  the  population  sigma.  The 
correction  becomes  trivial,  of  course,  if  sample  size  is  very  large.  Never- 
theless, it  is  conventionally  included,  as  a matter  of  principle,  even  in  large 
samples.  The  working  formula,  with  familiar  symbols,  will  therefore  read 
as  follows: 


JM. 

Vn- 1 

JZM1 

V n(n  - 1) 


(12.1.4) 


This  Expression,  then,  yields  the  standard  error  of  the  mean  which  we 
have  been  looking  for  and  from  which  we  shall  construct  the  interval 
estimate. 


* Greek  letters  are  conventionally  used  to  represent  parameter s (e.g.,  <r,  /i);  Latin 
letters  (c.g.,  *,  X)  represent  wt triple  statistics,  v.hich  are  necessarily  estimates.  But 
this  practice  is  not  uniformly  adhered  to  by  u rilers  La  the  field. 
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Construction  of  the  Confidence  Interval  ( Large  Sample).  Since  the  sampling 
distribution  of  the  mean  is  approximately  normal  around  the  population 
mean,  it  follows  that  two  out  of  three  sample  means  (G8.27  per  cent)  wall 
lie  within  one  standard  error  of  the  true  mean.  Our  particular  sample 
mean  will  therefore  have  a 2 in  3 chance  of  falling  within  one  standard 
error  of  the  true  mean.  Hence,  by  adding  and  subtracting  one  standard 
error  to  and  from  our  randomly  selected  sample  mean,  w*c  have  an  approxi- 
mately 2 in  3 chance  of  enclosing  the  true  mean.  Similarly,  95  out  of  100 
sample  means  lie  within  1 .90  standard  errors  of  the  true  mean;  in  conse- 
quence, if  we  attach  1.9G  standard  errors  to  either  side  of  the  sample  mean, 
we  have  a 95  per  cent  probability  of  enclosing  the  true  mean.  In  general, 
it  is  possible  to  provide  any  desired  confidence  interval  by  the  simple 
technique  of  attaching  the  requisite  multiple  of  the  standard  error  to  the 
observed  sample  mean.  By  thus  making  our  interval  estimate  wide 
enough,  we  may  be  practically  certain  that  the  true  mean  has  been  con- 
tained within  the  interval,  even  though  the  observed  sample  mean  is 
highly  inaccurate. 

Let  us  suppose  that  w*e  have  drawn  a “bad”  sample,  whose  mean,  X0, 
is  located  in  the  tail  of  the  sampling  distribution,  far  removed  from  the 
universe  mean  (ji ) which  it  intends  to  represent  (Figure  12.1.2).  Clearly, 


Figohe  12.1.2  Theoretical  Sampling  Distribu- 
tion of  Meant  Observed  Sample  Mean} 

in  this  instance,  it  would  not  have  been  sufficient  to  attach  .only  one 
standard  error  to  cither  side  of  the  observed  mean  in  order  to  .catch  the 
true  mean;  nor  would  it  have  been  enough  even  to  attach  two  standard 
errors,  since  the  true  mean  is  more  than  two  standard  errors  distant  from 
our  illustrative  sample  mean.  However,  if  w*c  affix  three  standard  errors 
to  either  side  of  the  observed  mean,  we  obtain  an  interval  whose  lower 
limit  extends  substantially  beyond  the  true  mean.  Since  practically  all 
(99.74  per  cent)  of  the  sample  means  lie  within  three  standard  errors  of 
the  universe  mean,  by  adding  and  subtracting  throe  standard  errors  to 
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and  from  any  randomly  selected  mean  we  become  virtually  certain  that 
the  resultant  interval  will  trap,  as  it  were,  the  target  mean.  Even  the  most 
pessimistic  soul,  who  always  tears  the  worst,  may  give  himself  a sense 
of  relative  security  by  attaching  three  standard  errors  to  his  observed 
sample  mean. 

This  method,  be  it  noted,  does  not  disclose  the  exact  whereabouts  of 
the  true  mean,  as  has  bec-n  previously  cautioned;  it  merely  furnishes  a 
stronger  or  weaker  expectation  that  the  true  mean  will  be  found  within 
the  specified  interval  estimate,  derived  from  the  observed  sample.  The 
true  mean  is  stationary,  wherever  it  is.  Since  it  is  the  interval  cst'matc 
that  either  succeeds  or  fails  in  enclosing  the  parameter  on  a given  sampling 
trial,  we  may  properly  speak  of  the  probability  of  such  an  interval  succeed- 
ing in  encompassing  the  universe  mean.  Of  course,  onto  the  sample  has 
been  drawn  and  the  interval  formed  — once  the  trial  is  over  — it  either 
does  or  does  not  include  the  target  mean  between  its  calculated  limits, 
although  we  will  never  know.  Our  confidence  rating  in  ilrat  interval 
corresponds  to  its  pre-trial  probability,  which  lias  beeu  selected  for  our 
convenience  and  purpose. 


Working  Procedure.  To  illustrate  the  process  of  calculating  a confidence 
interval,  we  carry  out  all  necessary  operations  on  the  following  sample 
of  30  suicide  rates: 


6.0 

1S.6 

4.6  * 

11.2 

18.8 

24.9. 

9.1 

14.7 

11.6 

22.4 

13.4 

11.5 

S.2 

2S.S 

10.0 

5.4 

7.5 

26.1 

8.6 

8.1 

23.4 

11.6 

14.S 

13.6 

6.0 

8.4 

10.0 

10.4 

9.6 

15.3 

(1)  We  compute  the  sample  mean: 

y _ 39245 
A“  30 
= 13.1 

(2)  We  compute  the  standard  error  of  the  sample  mean: 

„ _ i/X?iDL 

* V (30)  (29) 

« 1.2 

(3)  We  multiply  sx  by  the  number  of  sigma  units  to  be  added  and 
subtracted  for  the  agreed-upon  confidence  interval  — for  example,  1.90 
for  the  95  per  cent  interval: 

1.90  ss  = 1-90(1.2) 

» 2.4 


i ■ 
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(4)  We  add  and  subtract  l.PO;?  to  and  from  the  observed  sample 
mean,  X.  Thus: 

X ± 1.90s* 

13.1  + 2.3  « 15.1 
13.1  - 2.3  « 10.S 

(5)  Wc  finally  make  the  Interpretation  that  the  chanccvs  are  93  in  100 
that  the  true  mean  will  be  found  within  the  interval  10. $-15. 4.  We  are 
reasonably  sure  tint  our  interval  c<v  a i n s the  true  mean,  since  only  5 
intervals  in  100  constructed  in  the  an:e  manner  v.  ill  fail  to  do  so.  If, 
however,  it  doesn't,  something  has  happened  which  would  occur  only  5 
times  in  100,  a rbk  which  wc  may  bo  willing  to  face. 

But  how*  decide  on  th  ' width  of  the  confidence  interval?  Hero  one  can 
offer  only  the  most  guidance.  Into  the  choice  will  enter  a considera- 

tion of  the  consequences  of  the  clecisioi , the  risk  one  will  wish  to  run,  and 
even  the  temperament  of  the  person  concerned.  In  effect,  a decision  like 
this  is  computable  to  any  other  of  the  numerous  decisions  we  make  in  tho 
face  of  life's  uncertainties. 

Demonstration  of  Experimental  ConfJi'ice  /ntuiQh.  All  statements  of 
probability,  it  will  be  recalled,  are  based  on  the  assumptio.  of  an  infinite 
number  of  trials.  While  it  is  not  possible  to  conduct  such  an  infinity  of 
trials,  it  is  usually  possible  to  conduct  a fairly  largo  number  of  experimental 
trials,  the  outcomes  of  which  then  serve  to  test  the  initial  probability 
statement.  To  that  end,  wc  have  calculated  tho  93  par  cent  confidence 
limits  for  each  of  our  100  samples  (n  * 30)  of  suicide  rates,  in  order  to 
determine  whether  the  resulting  intervals  would  actually  enclose  the 
true  mean  approximately  95  per  cent  of  the  times.  The  diagram  on  the 
opposite  page  (Figure  12.1.3)  portrays  the  locations  of  the  100  confidence 
intervals,  along  with  the  known  population  mean  of  12.4.  I*  will  be  ob* 
served  that  exactly  95  out  of  100  interval  estimates  do  enclose  the  true 
mean,  a result  which  is,  surprisingly  enough,  identical  with  theoretical 
expectation.  Here  is  tangible  evidence  that  the  confidence  we  place  in  a 
properly  constructed  interval  estimate  is  amply  justified. 

Convenience  of  Large  SampUe.  Up  to  now,  all  probability  statements 
have  been  based  on  the  assumption  that  the  sampling  distribution  of  the 
mean  is  normal.  It  is  reassuring  that  such  a sampling  distribution  will 
always  be  normal  when  the  universe  itself  is  normal.  A normal  popula- 
tion products  normality  in  this  sampling  distribution,  whatever  the  sire 
of  the  sample. 

However,  it  is  particularly  characteristic  of  sociological  data  that  popu- 
lations are  often  not  normal.  Sire  of  families  income  and  other  social 
variables  distinguish  tnemselvcs  iVcm  typical  psychological  du'a  in  that 
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the/  are  often  severely  skewed.  We  may  therefore  raise  the  question  of 
whether  such  skewness  disturbs  the  required  normality  of  the  sampling 
distribution.  The  answer  is:  it  does.  Hut  at  the  same  time,  the  effect 
of  such  skewness  can  be  circumvented  hy  npprcpmtely  increasing  the 
sire  of  the  sample  beyond  the  approximate  minimum  of  30.  As  sample 
size  increases,  the  sampling  distribution  approaches  normality.  However, 
no  general  st  atement  can  be  made  about  the  rate  at  which  th?  sampling 
distribution  approaches  normality,  since  thi*  rate  will  vary  according  to 
the  severity  of  the  skew  to  be  offset.  Hut  we  may  gain  some  iusight  into 
what  might  on  its  face  seem  a rather  remarkable  phenomenon  by  examining 
the  following  experiment,  in  whl.h  we  drew  samples  of  size  20  and  samples 
of  size  40  from  a compilation  of  10 0 large  American  cities  for  size  of  popu- 
lation. Tire  original  distribution  ^Table  12.1.3)  was  highly  skewed  and 


Table  12,1.3 

Population  Ditlribulion, 
106  American  Citits 
100,000  one/  Over,  6y 
Sin  of  Population 


Size  or  Cim.i 
(’000) 

/ 

Per  Cent 

100-10) 

56 

52.9% 

200-00) 

14 

13.2 

300-309 

11 

10.4 

400 -4  CO 

«» 

< 

6.6 

500-500 

6 

4.7 

600-000 

2 

1.9 

*00-790 

l 

.9 

SOO-SOO 

3 

8.8 

000-900 

2 

1.9 

1,000  or  more 

5 

4.7 

106 

100.0% 

Soorc*:  V S.  of  tv.%  Ctmu#,  V.$.  CVstvi  a? 

• /?;*>.  O.a-tf.-ff/r  W-i  ft/  iW  PppA**i*%,  Vot. 
II,  Par I 1,  V S.  V S.  Oov*!T\m*r»t  Printing 

OGe*.  TVisVf  gioo.  D C,  1953,  Table  lb. 


consequently  ideally  suited  for  experimentally  testing  the  effect  of  a 
tkeaed  population  on  the  sampling  distribution  of  the  mean. 

The  first  set  of  200  samples  (n  *=  20)  was  drawn  and  the  mean  of  each 
sample  calculated  and  tabulated  (Table  12.1.4a).  This  experimental 
sampling  distribution  ri  still  perceptibly  skewed  to  the  right,  but  not 
nearly  to  the  same  degree  as  was  the  parent  universe.  ThU  reduction 
in  skewness  iva  re  suit  of  the  fundamental  principk  that  the  means  are 
great  Icvelers  which  necessarily  cut  dw.i  the  individual  extreme? 
found  in  the  universe.  It  follows  then h*ro  from  the  very  nature  of  the 
mean  that  a distribution  of  sample  mean*  can  never  bo  as  irregular,  or 
M widely  dispersed,  as  arc  the  individual  values  which  make  up  the 
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Figure  12.1.4  Histogram,  /OJ  Jncncon  Cilia  hy  Sue  of  Population 


Table  12.1.4a 

Distribution  of  200 
Sample  Meant,  n « 20 


Mean  (’000) 

/ 

Per  Cent 

100-199 

2 

1.0% 

200-279 

SO 

25.0 
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33.5 

4<x>-iro 

31 

15.5 
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9.0 

609  099 

17 

8.5 

“00-799 

8 

4.0 

500-599 

4 

3.0 

900-909 

1 
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1,000  of  more 
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samples.  Furthermore,  the  larger  the  sample,  the  more  successfully  will 
tbe  dispersion  be  reduced.  Thus,  when  sampling  from  a markedly 
skewed  population,  samples  even  os  small  as  20  will  produce  n sampling 
distribution  displaying  evidence  of  a strain  toward  normality,  although 
obviously  falling  far  short  of  that  goal. 

Samples  of  40  should  therefore  continue  the  trend  toward  normality, 
or  at  least  toward  symmetry.  Such  a distribution  is  tabulated  in  Table 
12.1.4b.  Gone  now  is  the  long  tail;  instead,  the  beginnings  of  symmetry 


ToNe  12.1.4b 

Diilribulion  oj  100 
Somplt  Mtam,  n « 40 


Mean  (000) 

/ 

Pea  Cent 

100-100 

0 

0.0% 

200-200 

10 

10.0 

300-305 

41 

41.0 

400-405 

21 

21.0 

300-505 

17 

17.0 

600-C05 

3 

3.0 

700-795 

7 

7.0 

800 -SO  5 

1 

1.0 

900-00) 

0 

0.0 

1,000  or  more 

0 

0.0 

100 

100.0% 

are  clearly  visible.  Thus,  sociologists  who  are  often  called  upon  tc  deal 
with  skewed  populations  may  draw  considerable  comfort  from  the  im- 
portant implications  of  this  little  experiment  and  may  feci  Justified  in 
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applying  normal  probability  calculations  to  their  data  by  taking  proper 
precautions. 

The  convenience  of  large  samples  consists  in  the  assurance  that  the 
sampling  distribution  of  the  moan  is  approximately  normal,  an  assump- 
tion which  may  bo  in  doubt  when  sample?  are  small.  Situations  do, 
however,  often  arise  when  small  samples  arc  not  only  convenient,  but 
even  unavoidable.  In  such  instances,  modification  of  the  foregoing  pro- 
cedures is  required,  although  no  r.cw  principle?  are  involved.  This 
adaptation  of  reliability  techniques  to  small  samples  is  therefore  not  con- 
sidered in  this  text. 

Confidence  I r,h  rial  for  a Percent':  gt  (Lorgc  Somf  U).  hike  the  mean,  the 
sample  percentage  also  raises  the  issue  of  reliability,  since  sample  per- 
centages are  equally  subject  to  sampling  errors.  Thus,  a given  sample 
percentage  (e  g.,  60  per  cent  favoring  military  training)  may  be  off  a 
trifle,  or  may  deviate  considerably  from  the  true  percentage.  Similarly, 
30  per  cent  of  a random  selection  cf  married  women  may  be  gainfully 
employed,  but  the  percentage  in  the  sampled  universe  may  conceivably 
be  os  low  as  20  or  as  high  as  SO.  Fifty-one  per  cent  of  the  sample  elec- 
torate may  signify  its  intention  of  voting  for  the  Republican  nominee 
in  the  forthcoming  election,  but  unless  we  can  in  some  way  assess  the 
accuracy  of  that  estimate  we  cannot  forecast  with  any  degree  of  confi- 
dence the  outcome  of  the  election.  Hence,  in  interpreting  a sample  per- 
centage, we  must  again  estimate  the  average  sampling  error  (standard 
error)  in  order  to  provide  a confidence  interval. 
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The  technique  of  constructing  an  interval  climate  of  a universe  per- 
centage is  in  its  essentials  no  different  from  that  of  the  mean:  we 
(1)  fix  the  desired  degree  of  confidence;  (2)  estimate  the-  standard  error; 
and  (3)  attach  to  tbo  observed  sample  percentage  the  multiple  of  the 
standard  error  needed  to  attain  the  agreed-upon  confidence.  The  sam- 
pling theory  on  which  this  procedure  rests  is  identical  with  that  under- 
lying the  treatment  of  large  sample  means:  namely,  the  distribution  of 
all  possible  percentages  ( p ) is  approximately  norma!  around  tho  universs 
percentage,  P,  with  standard  error 


cf  «= 


(12.1.5) 


where  o,  *=  standard  error  of  percentage 
P « universe  percentage 

0 - ioo  - r 

But  since  the  parameters  P and  Q arc  of  course  unknown,  we  ag-un  follow 
the  convention  of  substituting  the  sample  values  p and  5 in  order  to 
obtain  an  estimate  of  the  standard  error:  * 


<»•= 


(12.1.6) 


Computing  an  Infer  col  Estimate  cf  a Percentage.  Let  us  suppose  that  Out 
0/  600  randomly  selected  high  school  students,  C9  per  ceut  respond  in 
favor  of  universal  military  training  and  -10  per  cent  are  unfavorable. 
To  form  a confidence  interval,  we  must  first  estimate  the  standard  error: 


* VB$7 
» 1.63 


By  adding  and  subtracting  this  quantity  to  end  from  the  sample  per- 
centage, we  obtain  the  6S  per  cent  confidence  interval: 

60%  ± 1.03  percentage  p>int3 
58.37%  - 61.03% 

The  odds  ore  2 to  l that  the  true  percentage  lies  within  this  interval. 
If  we  should  desire  greater  confidence  — • say,  95  per  cent  — we  would 
attach  1.60  standard  errors: 

60  ± 1.60(1.63) 

60  ± 3.19 
56.81%  - 63.10% 

* A*  inJbtMse  of  lb*  rnsan,  » betwr  of  tb»  error  of  p b provid'd 

by  jffj-  For  a «K*c\j*rion  of  this  yr.V.,  see  tVffihn  O.  Cocbtin,  So-np’jn;  7t:\- 
nifett,  Joba  Wi>>  & Sons,  Inc,  New  York,  1933,  pp.  ??-3 J. 
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Since  intervals  constructed  in  this  manner  contain  the  true  percentage 
95  times  out  of  100,  we  may  declare  that  we  arc  confident  at  the  95  per 
cent  level  that  the  true  percentage  is  not  less  than  56. SI,  nor  greater 
than  63.19.  Since  5 per  cent  may  be  assumed  to  exclude  the  true  per- 
centage, wc  thus  take  a 5 per  cent  risk  cf  being  wrong. 

The  foregoing  procedure  is  another  instance  of  krgc-sampic  technique 
la  that  it  applies  to  samples  having  30  or  more  cases.  But  the  assump- 
tion of  a normal  sampling  distribution  hinges  on  the  ratio,  P:Qt  as  well 
as  on  sample  size.  As  the  P:Q  balance  depaits  more  and  more  from 
50:50,  the  sampling  distribution  becomes  increasingly  skewed. 

As  has  already  been  pointed  out  in  the  study  of  the  mean,  in  the  case 
of  percentages,  the  skew  in  the  P:Q  balance  communicates  itself  to  the 
sampling  distribution.  Again,  as  with  the  mean,  this  skew  can  be  cir- 
cumvented only  by  a sharp  increase  in  the  size  of  n,  so  as  to  instill  nor- 
mality in  the  sampling  distribution.  Table  12.1.5  shows  the  sample  sires 


Table  12.1.5 

Mfflfnv/H  ii  for  Selected  p-Vahu‘# 
for  Aform<if  •Approximation 


needed  for  varying  sample  percentages  in  order  to  permit  the  assump- 
tion of  a normal  sampling  distribution. 

The  Finite  Population  Multiplier.  In  actual  eockJ'.'gRal  investigations, 
we  ordinary  encounter  finite  populations,  and  yet  in  the  foregoing  in- 
stances we  have  calculated  the  standard  error  on  the  assumption  cf  sam- 
pling from  an  infinite  universe.  But  this  is  not  quite  so  unrealistic  as 
might  appear  on  its  face.  Finite  populations  arc  usually  hrgo  enough  to  bo 
considered  os  infinite,  which  is  incidentally  something  of  a practical  con- 
venience, since  reliability  procedures  arc  somewhat  ks  complex  for  infi- 
nite than  for  finite  populations.  For  finite  populations,  the  meosu remen t 
of  reliability  must  take  into  account  both  sample  sire  (n)  and  the  size  of 
the  universe  (AT),  whereas  for  infinite  populations,  the  computation 
of  reliability  need  take  into  tccourd  Orly  sample  she,  since  the  sire  of 
the  universe  is  incalculable  nnd  hence  cannot  vary. 
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Section  Two 

Testing  d Statistical  Hypothesis 

Hypothesis-Testing  and  Estimation  Compared.  There  are  two  general 
types  of  statistical  inference:  (1)  estimation,  which  begins  without  any 
stated  assumption  about  the  value  of  the  parameter  and  merely  seeks 
to  estimate  descriptively  what  the  parameter  could  bo;  and  (2)  hypothesis- 
testing,  which  begins  with  a hypothesis  about  the  parameter  and  then  uses 
the  sample  data  to  check  the  plausibility  of  that  statement. 

In  the  previous  section,  we  were  concerned  with  problems  of  estima- 
tion. Wc  began,  for  example,  with  the  observation  of  a sample  mean, 
and  from  this  wc  derived  an  interval  estimate  with  a specified  degree 
of  confidence.  We  first  drew  our  sample  and  then  made  our  estimate 
of  the  parameter. 

But  in  hypothesis-testing,  ve  formulate  our  hypothesis  about  a param- 
eter in  advance  of  the  collection  of  the  sample  data,  which  is  then  used 
to  test  that  hypothesis.  We  may,  for  example,  hypothesize  that  the  aver- 
age suicide  rate  of  eastern  cities  does  not  disTor  from  that  of  western  cities. 
We  befc'  with  that  supposition,  and  then  we  take  an  appropriate  sample 
of  eastern  and  western  cities,  compare  the  two  means  In  the  prescribed 
manner,  and  finally  reach  a decision  whether,  in  the  light  of  the  sample 
difference,  the  hypothesis  should  be  accepted  or  rejected. 

Although  an  interval  estimate  is  always  derived  from  the  previously 
compiled  sample  data,  a statistical  hypothesis  involves  quite  another 
analytical  process.  This  process  starts  with  an  antecedent  conjecture 
about  an  unknown  population  value,  presumably  arrived  at  without 
benefit  of  the  undrawn  sample.  Furthermore,  a hypothesis  is  tested  and 
then  acted  upon:  it  is  either  accepted  or  rejected,  it  requires  a decision, 
, which  is  made  with  a certain  degree  of  confidence,  and  which  is  cither 
correct  or  incorrect  after  it  has  been  made.  The  emphasis  thus  shifts 
from  mere  estimation  to  deeision-nahing. 

Tht  A full  Hypothesis  (//*)-  It  has  become  a convention  in  statistical 
testing  to  open  the  investigation  with  the  null  hypothesis,  svmbolitod 
Ht.  In  its  most  current  usage,  this  hypothesis  holds  that  two  or  more 
given  samples  have  come  from  statistically  identical  populations  and  that 
therefore  any  observed  difference  between  such  samples  is  merely  a 
chance  variation.  The  aforesaid  hypothesis  that  East  end  West  do  not 
differ  in  their  average  suicide  rates  would  therefore  be  a typical  null 
hypothesis. 

Essentially,  the  null  hypothesis  is  set  up  to  be  nullified;  however, 
every  other  type  of  hypothesis  i-  also  set  up  for  that  possibility.  How, 
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then,  does  the  null  hypothesis  differ  from  any  oilier  hypothesis  — from 
a hypothesis,  for  example,  stating  that  juvenile  delinquency  is  associated 
with  status  discontent,  or  that  the  incidence  of  suicide  depends  on  the 
degree  of  . ini  diorpnization?  These  hypotheses  we  nay  call  ex* 
plnnntory  h\ , , hs,  \ c.-  as  the  null  hypothesis  is  an  auxiliary  device 
which  tentative/  ilu*/'  ‘ validity  of  the  explanatory  theory.  Hence, 
d wc  reject  the  null  hv  c>is,  we  strengthen  the  credibility  of  our 
t planntory  hypothesis,  but  the  null  hypothesis  itself  has  no  explanatory 
value. 

Yet  science  is  in  search  of  relationships.  Hence,  in  accordance  with 
the  above  reasoning,  the  null  hypothecs  i?  usually  launched  with  the 
expectation,  indeed  with  (he  hope,  that  il  will  )>c  nullified,  as  the  dcri* 
vation  of  the  term  implies.  Nevertheless,  i?  represents  a possibility 
that  must  be  disposed  of  before  alternative  hypotheses  which  imply 
assignable  causes  can  be  considered. 

Returning  to  the  comparison  of  eastern  and  western  cities,  we  would 
set  up  such  a comparison  in  the  first  place  only  in  order  to  uncover  some 
of  the  factors  that  determine  the  incidence  of  suic‘de.  If  tve  do  discover 
a significant  difference  between  the  average  rates  of  East  and  West  and 
consequently  re’e  t tbc  mill  hypothesis,  we  will  have  added  a small 
increment  to  our  knowledge  of  suicide  — namely,  the  regional  factor  ns 
a source  of  variation  in  the  suicide  rate.  If,  on  the  other  hand,  the  two 
samples  of  data  show  no  significant  difference  and  we  accept  the  null 
hypothesis,  then  we  have  not  advanced  our  understanding,  although 
supposedly  we  will  not  have  retrogress'd.  In  this  latter  case,  we  w*ould 
have  to  contiive  new  kinds  cf  comparison*  in  our  search  for  the  assign* 
able  causes  of  suicide. 

Thus,  it  is  only  when  the  preliminary  hypothesis  of  "no  difference" 
has  been  cleared  away  that  wc  gain  some  insight  into  the  occurrence  of 
suicide.  Hence,  the  null  hypothesis  is  inherently  linked  with  a more 
constructive  statistical  hypothesis,  sometimes  calkd  an  alternative  hy* 
pothesis,  which  becomes  tenable  to  the  extent  that  the  null  hypothesis  has 
been  discredited.  Such  an  alternative  hypothesis  may  specify  an  exact 
degree  of  diffcirncc  between  East  and  Uc?t,  with  a view  to  gauging  the 
strength  of  the  unknown  variables  which  produce  suicide  in  a disorganized 
society.  Clearly,  tlwrc  is  no  point  in  pursuing  the  effects  of  a given  control 
or  experimental  variable  such  as  geographical  region  jf  we  have  found  in 
favor  of  the  null  hypothesis.  Thus,  by  analogy /the  r.ull  hypotliesis  serves 
the  purpose  of  a criminal  trial:  wc  set  up  the  hypothesis  of  innocence, 
giving  the  evidence  an  opportunity  to  nullify  it.  Only  if  and  when  that 
presumption  is  rejected  dees  the  eomt  give  thought  to  alternative  punish- 
ments comspondmg  to  the  degree  of  guilt. 

In  accordance  with  the  foregoing  principle,  the  null  hypothesis  has 
corr.e  to  be  predominantly  identifis-l  with  two  types  of  research  pro* 
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ccdurcs:  (1)  the  comparison  of  two  or  more  populations  on  a given  trait 
and  (2)  the  correlation  between  two  or  more  traits  in  a given  population. 
In  the  first  type,  the  null  hypothecs  posits  no  difference  between  popu- 
lation parameters;  in  the  second  type,  it  asserts  a chance  relation,  or 
lero  correlation,  between  the  variables  under  study. 

Principles  of  Testing  the  A TvH  Hypothesis.  Let  us  now  suppose  a random 
sample  of  30  eastern  cities  and  a comparable  sample  of  30  western  cities 
whoso  mean  suicide  rates  arc  10. 1 and  11.3,  respectively  (Table  12.2.1). 
Wo  wish  to  know  whether  the  observed  difference  of  3.9  could  be  indic- 


Table  12.2.1 

SufciVc  Rates,  Pastern  a 
Western  Cities,  n = SO 
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ative  of  an  actual  difference  between  tlrn  morns  of  the  sampled  popu- 
lations, or  whether  it  could  more  plausibly  be  accepted  ns  mere  sampling 
variation.  By  rule  of  thumb,  we  lay  >»o\vn  the  mill  hypothesis  that  the 
population  means  of  East  caid  West  are  identical  and  sub;^*t  this  hypoth- 
esis to  the  statistical  tc-t*  If  we  reject  this  hypothesis,  we  then  proceed 
as  though  there  were  a real  difference,  wSnreos  acceptance  presumably 
implies  that  the  two  populations  ere  alike  and  that  the  observed  differ- 
ence is  a mere  sampling  error. 

An  important  principle  of  our  teeing  procedure  is  that  m can  ncm 
prow  the  null  hypothesis  true.  The  bc?t  possible  evidence  that  we  could 
ever  obtain  for  the  identity  of  the  two  population  means  would  be  an 
identity  between  the  two  sample  means.  But  even  if  samples  of  East 
and  West  were  to  show  identical  averages,  we  still  could  not  be  positive 
that  the  null  hypothesis  was  true.  Such  an  identity  between  sample 
menus  could  itself  very  well  be  the  result  of  sampling  errors,  because 
population  means  could  differ  by  a sizable  amount  and  the  sample  differ- 
ence still  be  zero.  Similarly,  the  finding  that  a sample  of  married  spouses 
ore  of  the  same  average  age  would  not  prove  that  all  married  couples 
show  equal  average*.  From  sample  observations,  the  conclusions  we  draw 
about  the  truth  of  the  null  hypothesis  me  necessarily  of  a probabilistic 
nature. 

If  we  can  never  prove  the  null  hypothesis  true,  may  we  prove  it  defi- 
nitely false?  Here  the  statistical  evidence  may  be  more  convincing. 
The  best  possible  evidence  that  the  two  populations  differ  would  be  a 
difference  between  samples.  A small  difference  would  not  be  very  com- 
pelling; however,  as  this  difference  becomes  larger  and  larger,  the  case 
for  a population  difference  become-  stronger  and  stronger.  If  the  differ- 
ence between  samples  become*  so  large  that  it  is  highly  improbable 
that  they  stem  from  the  same  universe,  then  we  may  with  practical 
assurance  reject  the  null  hypothesis.  Even  here,  however,  there  is  no 
infallible  certainty,  since  the  one  case  in  a million  — the  freak  event  — 
may  happen.  Highly  improbable  events  are  regularly  occurring,  how- 
ever startled  we  arc  when  they  hxfnll  us.  But  mkIi  an  extreme  case  — 
the  one  in  a million  — is  so  improbable  that  most  cf  us  would  discount 
it  by  regarding  it  as  impossible,  thereby  rejecting  the  null  hypothesis 
with  gieat  confidence.  In  other  wort!?,  wo  are  willing  to  reject  the  null 
hypothesis  when  the  statistical  probabilities  of  being  wrong  are  small 
enough  to  suit  our  pm  poses. 

These  decisions  — whether  or  not  to  reject  the  null  hypothesis  — arc 
accordingly  made  with  varying  digress  of  confidence.  This  confidence 
varies  according  to  the  probability  that  a difference  at  least  ns  large  as 
that  obtxivcJ  could  have  been  obtained  by  chance,  assuming  the  truth 
of  the  null  hypothesis.  We  must  thetc-fvic  now  turn  to  the  procedure 
by  which  such  a probability  is  determined. 
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The  previous  two  readings  provide  law  students  with  an 
introduction  to  statistical  theory.  The  usefulness  of  coming  to 
grips  with  such  material  is  linked  to  two  considerations: 

(1)  there  is  a fundamental  likeness  between  legal  reasoning  and 
statistical  theory,  and  (2)  law  practice  in  the  future  will  contain 
numerous  demands  that  the  practitioner  be  familiar  with 
statistical  procedures. 

In  discussing  the  similarity  between  legal  reasoning  and 
statistical  theory  Michael  Finklestein  argues  that  statistical 
decision  theory  is  relevant  to  law  because  legal  principles  are 
the  intuitive  equivalents  of  ideas  used  in  probability  theory. 
While  his  illustration  for  this  statement  is  based  specifically 
on  racial  discrimination  in  jury  selections,  we  feel  that  it  has 
significance  beyond  his  immediate  example.  "Probability 
theory, " he  notes,  "has  been  found  to  apply  to  events  commonly 
called  random  or  chance."  That  is,  the  events  are  referred  to 
as  random  because  for-  uil  practical  purposes  there  is  no  way  of 
predetermining  their  outcome  which  is  another  way  of  saying 
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that  events  have  an  equal  chance  of  occurring.  This,  he  concludes, 
is  in  theory  similar  to  the  constitutional  requirement  that  jurors 
be  chosen  without  regard  to  their  race.  This  is  not  to  say  that 
juries  must  be  entirely  representative  of  the  population  with 
respect  to  the  number  of  Negroes  and  whites  selected.  The 
use  of  literacy  tests  or  lining  telephone  listings  may  prevent 
Negroes  from  serving  on  juries;  and,  presumably,  such  devices 
do  not  violate  the  equal  protection  clause  of  the  Constitution.  But 
within  the  pool  of  eligibles,  however  restricted  in  size,  the 
selection  must  be  made  without  reference  to  race;  and,  as 
Finklestein  points  out,  this  means  that  the  racial  composition 
of  jury  venires  is — or  should  be — the  result  of  random  selection 
and  consequently  an  appropriate  subject  for  probability  theory 
analysis. 
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pp.  338-376.  Reprinted  by  permission. 
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Judges  have  often  relied  on  intuitive  notions  of  probability  in  de- 
ciding whether  there  has  been  discrimination  in  selection  of  juries, 
ifr.  Finkelslein  uses  mathematical  analysis  to  show  the  extreme 
improbability  that  nondisc  rim: no tory  selection  occurred  in  situations 
in  which  courts  have  been  unwilling  to  find  a isolation  of  the  four- 
teenth amendment.  His  approach  is  not  intended  to  supplant  judicial 
weighing  of  values;  on  the  cor.tr ary,  it  exposes  important  legal  issues 
previously  hidden  by  evidentiary  dif.culties. 

HIS  article  will  explore  the  applicability  of  the  branch  of 


mathematics  known  as  statistical  decision  theory  to  cases 
involving  claims  of  discrimination  in  the  selection  of  jurors.  It 
is  my  purpose  to  show  that  in  these  cases  the  Supreme  Court  has 
established  legal  principles  which  in  many  instances  cannot  ra- 
tionally be  applied  without  such  mathematical  techniques. 

Over  the  past  two  hundred  years  the  romance  between  mathe- 
matics and  law  has  been  for  the  most  part  a one-sided  affair. 
The  prospect  of  making  their  discipline  effective  in  an  area  of 
social  concern  has  persistently  teased  mathematicians,  but  law- 
yers, judges,  and  administrators  have  been  slow  to  accept  their 
advances.  Elementary  mathematical  techniques  occasionally  have 
been  used  in  the  law  where  they  obviously  facilitated  the  achieve- 
ment of  legal  objectives.  Every  judge  deciding  an  antitrust  merger 
case,  for  instance,  has  used  simple  mathematics — percentages  — 
to  illuminate  share-of-market  statistics.  The  reason  is  that  the 
meaning  of  percentages  is  well  understood  an.d  size  comparisons 
are  almost  impossible  without  them.  On  a more  sophisticated 
level,  the  apportionment  of  congressional  representatives  is  now 
controlled  by  a mathematical  technique  known  as  the  “method 
of  equal  proportions,”  a technique  adopted  because  mathemati- 
cians and  legal  scholars  were  able  to  persuade  Congress  that  the 
objectives  of  fair  apportionment  cot, Id  best  be  accomplished  in 

♦Member  of  the  'nV.v  Yur!<  Bir.  A. 3.,  Harvard,  iq 35,  LL.J.,  1953. 

I am  Indebted  to  ProtCijorj  Dime!  Greenberg  md  Richard  Fruxlhcrg,  Colum- 
bia University,  and  to  my  tsife  Elinor,  for  lieL-  reading  and  correction  of  the 
manuscript. 
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this  way.1  There  is  also  an  extensive  literature  on  the  subject 
of  mathematical  prediction  of  legal  decisions  by  correlation  analy- 
sis and  other  techniques.2 * 

It  is  a reflection  of  the  empirical  orientation  of  probability 
theory  that  this  branch  of  mathematics  was  the  first  to  be  applied 
to  a legal  problem.  In  1785,  the  French  mathematician  Marie 
Jean  Antoine  Nicolas  Caritat  Condorcet  published  his  Essai  sur 
V application  de  ^analyse  c la  probability  tics  decisions  rendues  d 
la  plurality  des  voix.  Condorcet’s  Essai  explored  the  probability 
of  error  in  a decision  made  by  a group  of  persons  — the  principal 
application  of  the  theory  being  jury  decisions.  This  subject 
captured  the  imagination  of  French  mathematicians.  Laplace  dis- 
cussed it  in  his  Traiti  des  probabilitys  published  in  1S12,  and 
Poisson  exhausted  it  in  his  Rcchcrckcs  sur  la  probability  des 
jugements  cn  matitre  crininclle  cl  cn  via  Here  civile,  which  ap- 
peared in  1837.  No  other  legal  problem  has  ever  received  such 
extensive  mathematical  treatment.  In  the  introduction  of  his 
Rickcrckcs  Poisson  wrote  hopefully  that  "among  the  applications 
of  this  calculus,  one  of  the  most  important  is  that  which  concerns 
the  probability  of  the  correctness  of  judgments  . . . 

The  reception  of  this  work  would,  perhaps,  have  surprised  its 
author.  Poisson’s  Richcrchcs  has  become  a cornerstone  of  mod- 
ern probability  theory,  with  an  enormous  range  of  scientific 
applications.  Its  predictive  power  has  been  demonstrated  re- 
peatedly. But  the  law  itself  has  remained  virtually  immune  to  its 
influence.  If  there  was  any  discernible  impact  on  the  French 
administration  of  justice,  the  fact  has  not  been  recorded,  and  from 
that  day  to  this  lawyers  have  made  only  slight  use  of  probability 
and  statistical  theories  although  in  certain  respects  these  branches 
of  mathematics  are  uniquely  suited  to  legal  problems.4 

1 2 U.S.C.  I 2{b)  (1954).  See  Huntington,  The  Apportionment  o f Representa- 
tives in  Congress,  30  Transactions  or  rue  Am.  Math.  Soc'y  $5  (1928),  For  a 
mathematical  approach  to  reapportior.meru  see  Weaver  & Hess,  A Procedure  for 
Ronpartisan  Districting:  Development  of  Computer  Techniques,  73  Yaie  L.J. 
:S3  (1963)* 

* See,  e.g.,  Schubert,  Judicial  Attitudes  end  Voting  Behavior:  The  sg6t  Term  of 
the  United  States  Supreme  Court,  2S  Law*  & Cos  temp.  Pros,  ico  (1963).  Unlike 
the  mathematical  analysis  of  lejal  issues  which  is  proposed  hue,  mathematical 
prediction  frequently  involves  consideration  of  legally  irrelevant  facts  (such  as  the 
type  of  court  in  which  the  case  arises)  which  are  found  to  have  a statistical  cor- 
relation with  the  result. 

*S.  Poisson  at  1-3  (transl.). 

4 There  has  teen  some  effort  (much  of  it  mistaken)  to  use  statistical  theory  in 
connection  with  the  so-called  duplication  prob’em.  See  50  Mins.  L.  Rev.  745 
(1966). 
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Statistical  decision  theory  is  relevant  to  Jury  discrimination 
cases  because  legal  principles  in  these  cases  are  the  intuitive  equiv- 
alents of  ideas  used  in  probability  theory.  This  does  not  imply 
that  legal  decisions  can  be  the  products  of  mathematical  proofs; 
the  differences  between  legal  and  mathematical  reasoning  preclude 
any  such  merging  of  the  two  sciences.  It  does  mean,  however,  that 
in  the  more  difficult  jury  discrimination  cases  an  informed  judicial 
decision  cannot  be  made  without  mathematical  analysis  of  the 
underlying  data. 

This  article  will  present  various  methods  of  applying  mathe- 
matical techniques  to  problems  of  jury  discrimination,  using  as 
examples  cases  involving  racial  discrimination  L the  selection 
of  veniremen  and  grand  jurors.  The  same  techniques  can  be 
applied  to  investigate  discrimination  on  the  basis  of  other  factors, 
such  as  economic  status  or  sex.  A rather  detailed  mathematical 
treatment  is  presented  here  because  it  now  seems  clear  from  the 
fate  of  other  efforts  that  mathematical  techniques  are  unlikely  to 
receive  judicial  acceptance  until  there  is  a much  fuller  under- 
standing of  the  meaning  of  statistical  methods  and  the  nature 
of  their  relationship  to  the  relevant  legal  issues.5 

I. 

For  more  than  seventy-five  years  it  has  been  the  law  that  a 
conviction  in  a state  court  violates  the  equal  protection  clause 
of  the  fourteenth  amendment  if  it  is  based  on  an  indictment  of 
a grand  jury  or  a verdict  of  a petit  jury  from  which  Xegroes  were 

•Attempts  to  apply  statical  methods  to  these  p/oblems  have  been  marred 
by  cryptic  explanation  and  have  r.ot  met  with  judicial  acceptance.  Billingsley  v. 
Clayton,  359  F.:d  13  (5th  Cir.  1956) , Brie:  lor  Appellants  at  53-54  (probability  of 
observed  number  of  Xegroes  in  th?  jury  box  :*3:d  to  be  less  than  one  in  a billion  in 
one  division  and  less  than  one  in  :o  million  in  a second  division) ; United  Slates  v. 
Hoffa,  196  F.  Supp.  25  (S  D.  Fla.  i jSi),  Reply  Memorandum  for  Defendants  at  S 
(MTbe  statistical  probabilities  of  random  selection”  made  the  absence  of  women  ‘a 
virtual  impossibility”).  A far  more  elaborate  effort  was  made  in  United  States 
v.  Van  Allen,  jo3  F.  Supp.  33:  (S.D  X.V.  1962).  The  defendants  presented  testi- 
mony by  a statistician  and  a sociologist  to  the  effect  that  the  jury  commissioners 
in  the  Southern  District  of  New  York,  discriminated  against  the  poor.  The  court 
rejected  the  contention.  Unfortunately  the  presentation  was  cryptic  because  the 
Statistician  did  not  reveal  the  basis  for  his  conclusions. 

It  has  been  suggested  that  probability  theory  might  be  used  to  predict  or  ex* 
plain  the  outcome  of  jury  discrimination  cases.  Ulmsr,  Sitprem  Coin  Be'tivhr  in 
Racial  Excl’tifon  Ccscs:  56  Am.  Pol.  Set.  Rev.  325  (igSi).  The  method 

presented  here  is  intended  not  to  reduce  pad  holdings  to  a mathematical  standird 
but  to  provide  courts  with  a new  method  of  analyzing  evidence. 
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excluded  because  of  their  race.8  Discrimination  must  of  course 
be  proved.1  In  the  initial  cases  in  which  the  principle  emerged, 
problems  of  proof  did  not  arise  because  the  fact  of  discrimination 
was  never  put  in  issue.  Negroes  were  excluded  by  statute;  9 the 
fact  of  exclusion  was  uncontested; 0 or  the  principle  was  tested 
solely  on  the  pleadings.10  In  three  early  cases,  discrimination 
was  denied  by  the  state,  but  the  defendants  failed  to  produce 
evidence  to  support  their  claims." 

The  current  type  of  case  first  appeared  in  1934  with  the  Su- 
preme Court  decision  in  Norris  v.  Alabama}-  There  the  Court 
faced  for  the  first  time  a disputed  issue  of  discrimination  in  the 
selection  of  grand  and  petit  jurors  and  a record  containing  evi- 
dence purporting  to  bear  on  the  issue.  The  evidence  on  the  mo- 
tion to  quash  the  indictment  was  that  although  Negroes  comprised 
7j4%  of  the  total  adult  male  population  of  Jackson  County, 
where  the  petitioners  were  indicted,  no  Negroes  in  the  recollec- 
tion of  witnesses  had  ever  served  on  any  grand  or  petit  jury.  This 
single  fact,  the  Court  held,  made  out  a "prima  facie  case  of  the 
denial  of  the  equal  protection  which  the  Constitution  guaran- 
tees.”11 The  prirria  facie  case  was  “supplemented,”  the  Court 
noted,  by  evidence  of  the  presence  of  at  least  thirty  qualified 
Negroes  in  the  county  and  by  the  testimony  of  the  jury  com- 
missioners that  Negroes  had  not  been  included  on  the  jury  roll 
because  their  names  were  "never  discussed.” 

The  evidence  on  the  motion  to  quash  the  trial  venire  was  that 
although  Negroes  comprised  about  iS%  of  the  adult  male  popu- 
lation in  Morgan  County,  where  the  petitioners  were  tried,14  and 
although  many  were  qualified,  no  Negro  within  the  memory  of 
witnesses  had  ever  served  cn  a jury  or  been  called  for  service. 
The  county  sheriff  was  unable  to  identify  any  names  of  Negroes 


•The  seminal  cases  are  Strauder  v.  West  Virginia,  10a  US.  303  (tSSo),  and 
Neal  v.  Delaware,  103  U.S.  3; 0 (iS3o).  Legislation  designed  to  enforce  thesa 
rutings  has  recently  been  rejected  by  the  Senate,  in  Cong.  Rec.  35,091-117  (daily 
ed.  Sept.  1966). 

•Tarrance  v.  Florida,  r3S  US.  519  (1903). 

•Bush  v.  Kentucky,  107  U.S.  no  (1SS3). 

•Neal  v.  Delaware,  103  US.  370  (rSSi). 

‘•Carter  v.  Texas,  177  US.  432  (150a). 

“Martin  v.  Texas,  200  U.S.  316  (t;si);  Srov.n5.eM  v.  South  Carolina,  139 
US.  436  (1903);  Smith  v.  Mississippi,  rSa  U.S.  392  (tSg5). 

‘*391  US.  587  (i9JJ>. 

11  Id.  at  59 1. 

“The  triat  was  transferred  to  Morgan  County  because  of  asserted  prejudice 
against  the  defendants  bn  Jackson  County. 
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on  the  jury  roll.  The  Court  held  the  general  denials  of  discrimina- 
tion by  a member  of  the  jury  board  insufficient  “to  rebut  the 
strong  prima  fade  case  which  the  defendant  had  made.”  15 

The  issue  in  Norris  was  whether  the  particular  grand  and  petit 
juries  which  indicted  and  convicted  the  petitioners  were  selected 
without  discrimination,  No  testimony  was  introduced  pertaining 
specifically  to  the  selection  of  either  jury;  instead,  discrimination 
was  proved  from  the  racial  statistics  of  jury  selection  over  a num- 
ber of  years.  Statistical  evidence  of  this  type  can  be  relevant  in 
different  ways  to  the  existence  of  discrimination  in  the  choice  of 
a particular  venire.  The  absence  of  Negroes  from  other  venires 
selected  from  the  same  jury  roll  may  be  evidence  that  there  are  no 
Negroes  on  that  roll.  The  practice  of  the  jury  commissioners  in 
excluding  Negroes  from  other  venires  may  be  evidence  of  their 
intent  to  exclude  Negroes  from  the  venire  in  question.50  The 
unvarying  failure  of  different  groups  of  commissioners  to  select 
any  Negroes  may  be  evidence  of  a governing  custom  of  discrim- 
ination. Whatever  the  relevant  inference,  the  deduction  of  dis- 
crimination rests  on  the  idea  that  the  absence  of  Negroes  from 
the  jury  roll  or  from  jury  venires  is  evidence  of  racial  exclusion 
somewhere  in  the  selection  process  when  Negroes  constitute  a 
substantial  segment  of  the  population. 

The  facts  in  Norris  made  it  easy  for  the  Court  to  accept  this 
idea.  The  absence  of  Negroes  from  juries,  jury  venires,  and  the 
jury  roll,  and  the  direct  testimony  of  the  commissioners  that 
Negroes  were  never  considered,  combined  to  present  a clear  case 
of  discrimination.  The  facts  in  other  cases  in  which  the  Court 
has  granted  relief  have  been  equally  extreme.17  But  since,  on  the 
Court’s  reasoning,  the  Constitution  prohibits  any  discrimination  in 
selection,  it  was  inevitable  that  the  ideas  used  by  the  Court  in 
Norris  would  be  tested  in  cases  where  the  evidence  was  less 
blatant.  Challenges  were  made  in  cases  where  Negroes  had  ap- 
peared on  juries,  although  in  proportions  smaller  than  their 
proportion  in  the  population.  In  such  underrepresentation  — as 

,J  394  US.  at  593. 

,#In  Brown  v.  Alien,  54}  U.S.  445,  47-3  (1953),  the  Court  noted  that  "past 
practice  (a  evidence  of  past  attitude  oi  mind."  It  found  such  evidence  not  decisive 
because  there  had  been  a purge  and  refilling  of  the  jury  boa  before  the  jury  in 
question  was  selected. 

11  E-t-,  Arnold  v.  N'orth  Carolina,  37a  U.S.  773  ((964)  (one  Negro  served  on 
grand  jury  In  l»,  years);  Hid  v.  Tenas,  516  U.S.  4°v  ((94a)  (no  Negroes  served 
or.  grand  jury  fo;  at  least  16  years). 
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opposed  to  exclusion  — cases,  the  Supreme  Court  has  yet  to  hold 
that  the  disparity  was  sufficient  to  show  discrimination.' H 

In  all,  five  to  eight  underrepresentation  cases  have  come 
before  the  Court. !n  There  is  promise  of  many  more.  In  addition 
to  a larger  flow  of  cases  involving  criminal  defendants,  an  increas- 
ing number  of  class  actions  has  been  brought  on  behalf  of  Negroes 
to  secure  the  right  to  serve  on  juries."'1  The  plaintiffs  In  these 
cases  seek  not  to  set  aside  a conviction  but  to  restructure  the 
selection  system.  Inasmuch  as  the  claims  in  these  cases  involve 
many  juries,  the  proof  by  necessity  is  statistical.  The  statistics 
usually  show  not  exclusion  but  underrepresentation,  thus  testing 
the  principles  of  the  exclusion  cases  in  an  important  institutional 
setting.01 

Dallas  County,  Texas,  was  the  source  of  the  first  two  under- 
representation cases.  The  stage  was  sot  in  194a  when  the  Supreme 
Court,  in  Hill  v,  Texas ^ reversed  the  murder  conviction  of  a 
Negro  because  Negroes  had  been  totally  excluded  irom  grand 
juries,  Including  the  grand  jury  which  indicted  him.  Following 
this  decision,  a Negro  appeared  on  each  of  the  next  two  grand 
jury  lists,  and  one  served  on  the  grand  jury  which  indicted  Robert 
Akins,  a Negro,  for  the  rape  of  a white  woman.  Akins  appealed  his 
conviction  to  the  Supreme  Court,  claiming  that  the  number  of 
Negroes  serving  on  the  grand  jury  which  indicted  him  had  been 
limited  by  design  to  cue.  In  Akins  -j,  Texas21  the  Court  rejected 
this  claim.  It  computed  that  “on  the  strictly  mathematical  basis 

c/.  Anderson  v.  Alabama,  3f,6  U.S.  2cS  (igOt),  rex' 'g  ptr  curiam  270 
Ala.  575,  1:0  So.  26  397  (1960),  where  both  umlv  representation  cn  venires  and 
exclusion  from  grand  juries  wc;e  alleged. 

19  Aldr.s  v.  Texas,  3:5  U.S.  393  (19*5);  Cassell  v.  Texas,  339  U S.  2 32  (1950) ; 
Brown  v.  Allen  and  Speller  v.  Allen,  344  U S.  i\s  (195?) ; Swain  v.  Alabama,  3S0 
US.  20:  (19^5).  The  number  is  ei^Vit  i?  Smith  v.  Texas,  ju  U.S.  12$  (1040), 
Barksdale  v.  Louisiana,  247  La,  153,  170  So.  :d  37;  ( 19X4 ) ( ce/t.  denied,  j3j>  U.S. 
921  (1965),  and  Anderson  v.  Alabirm,  36J  U-5.  :c  > (19  ;i),  are  included.  Cole- 
man v.  Alilumr,  377  U.S.  1:9  (1964),  his  beer*  re  rum  Led  far  hearing  on  the 
merits  of  the  underrepresentation  issue.  Si, -ns  v.  ::i  Ca.  igo,  144  S.E.rd 

103  (1965),  cert.  granted,  384  U.S.  95$  is  per.  ilng. 

30  £ f White  v.  Crook.  251  F,  Supp,  4:1  XM.D.  Ala.  1960)  (jury  box  required 
to  be  emptied  and  refilled). 

71  See,  e i , Mitchell  v.  Johnson,  F.  Supp.  n;  (M  D Ala.  1960),  where  the 
number  of  white  persons  on  the  jury  roll  represented  55*3  of  the  eligible  white 
population  and  the  number  0;  Xearces  represented  only  9 7*  of  the  Ne.;ro  popula- 
tion. The  court  held  that  this  evidence  demonstrates  “wide  di  proportions**  and 
that  this,  “without  mere,  requires  an  inference  of  systematic  cj.chisiaa  on  rn;ul 
grounds  . . . .” 

12  316  U.S.  (1512). 

" Jjj  e.S.  i^5  (1543). 
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of  population,  a grand  jury  of  twelve  would  have  1.S552  Negro 
members  on  the  average/’  apparently  deriving  this  figure  by 
multiplying  the  Negro  percentage  of  the  population  by  the  num- 
ber of  jurors.  Since  there  had  been  one  Negro  member  on  each 
jury  list,  and  since  one  served  on  the  jury  that  indicted  Akins,  the 
Court  concluded  that  “we  cannot  say  that  the  omission  from  each 
of  the  two  lists  of  all  but  one  of  the  members  if  a race  which 
composed  some  fifteen  per  cent  of  the  population  alone  proved 
racial  discrimination.” 01 

During  the  five  and  one-half  years  after  the  Hill  decision,  there 
were  twenty-one  grand  juries  in  Dallas  County.  It  is  consistent 
with  the  implied  invitation  in  Akins  that  one  Negro  each  appeared 
on  seventeen  grand  juries  and  none  on  the  remaining  four.  These 
statistics  were  scrutinized  by  the  Supreme  Court  in  Cassell  v. 
Texas' 5 the  third  of  the  Dallas  County  cases.  Cassell,  a Negro 
convicted  of  murder,  charged  discrimination  and  relied  on  the 
above  statistics  and  on  the  testimony  of  the  jury  commissioners 
to  prove  it.  Since  there  had  been  only  seventeen  Negroes  of  the 
252  members  on  the  twenty -one  grand  juries,  only  6.7 % of  the 
jurors  were  Negroes.  The  Court  observed,  however,  that  the 
discrepancy  between  this  figure  and  the  15.5/0  Negro  population 
was  explicable,  because  Negroes  constituted  only  6.5^  of  the 
county  poll-tax  payers  and  payment  of  the  poll  tax  was  a pre- 
requisite for  jury  service,  Since  6.7 Kc  was  the  racio  of  Negroes 
actually  sitting  on  the  juries,  the  Court  concluded  that  “without 
more  ft  cannot  be  said  that  Negroes  had  been  left  off  the  grand 
jury  panels  to  such  a degree  as  to  establish  a prima  facie  case  of 
discrimination/’ :o 

The  distribution  of  Negroes  presented,  in  the  Court's  view,  a 
different  question.  If  it  was  true  that  the  jury  commissioners  had 
limited  their  number  to  one  on  each  jury,  this  action  would  have 
been  unconstitutional  sin.ee  “jurymen  should  be  selected  as  in- 
dividuals . . . and  not  as  members  of  a race/,2T  But  Mr. 

at  405-06.  Apart  from  the  Sup::.".cijTIty  of  the  mathematical  argument, 
the  decision  might  not  have  caused  corded-: a.  but  the  Court  added  a cat*: at  to  its 
opinion  which  is  wholly  inexplicable  in  the  light  oi  former  rulings  that  race  may 
not  be  conf'dertd  in  the  * Section  process:  “This  conclusion  makes  it  unnecessary 
to  decide  whether  a purposeful  limits* r r.  0:  jurors  by  race  to  the  approximate  pro- 
portion that  the  eligible  jurymen  of  the  race  so  United  bears  to  the  total  eliglV.es 
Is  Invalid  under  the  Fourteenth  Arr.^nJxent."  Id.  at  407. 

,5S39  U.S.  :3c  (1950). 

,4M.  at  235-55.  This  b a classic  example  of  the  fai’urc  of  intuition  to  compre- 
hend the  signiacance  of  data. 

V.S.  at  :3S. 
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Justice  Reed’s  opinion  for  the  Court,  concurred  in  by  three  other 
■Justices,  avoided  resting  decision  on  the  ground  that  the  appear- 
ance of  a single  Negro  on  each  of  seventeen  juries  was  evidence 
of  limitation;  instead,  it  reversed  because  the  commissioners 
testified  that  they  took  only  people  they  knew  and  admitted  they 
did  not  know  any  Negroes.  “When  the  Commissioners  were  ap- 
pointed as  judicial  administrative  officials,  it  was  their  duty  to 
familiarize  themselves  fairly  with  the  qualifications  of  the  eligible 
jurors  of  the  county  without  regard  to  race  and  color.  They  did 
not  do  so  here,  and  the  result  has  been  racial  discrimination.”  2* 

Although  the  decision  in  Cassell  was  based  on  other  grounds, 
the  case  was  the  first  in  which  the  Court  accepted  an  explanation 
for  a disproportionately  small  Negro  representation  on  juries. 
The  issue  reappeared  a few  years  later  in  two  cases  reported  to- 
gether from  North  Carolina,  Brown  v.  Allen  and  Speller  v.  AUckP 
The  Court  then  proved  willing  to  accept  such  explanations.  Brown 
and  Speller,  Negroes,  were  convicted  in  the  North  Carolina  courts 
of  rape  and  were  sentenced  to  death.  Their  challenges  to  the 
grand  and  petit  juries  were  based  solely  on  statistical  evidence. 
In  Brown’s  case  this  showed  that,  although  Negroes  comprised 
33.5%  of  the  adult  population  of  the  county,  in  the  year  of  his 
indictment  and  the  previous  year  only  to  icyt  of  those  chosen 
for  grand  jury  service  were  Negroes;  in  the  year  he  was  indicted 
the  percentage  of  Negroes  on  petit  jury  panels  ranged  from  9 to 
17%.  The  Court  stated  that  “variations  in  proportions  of  Negroes 
and  whites  on  jury  lists  from  racial  proportions  in  the  population 
have  not  been  considered  violative  of  the  Constitution  where  they 
are  explained  and  not  long  continued.”™  It  concluded  that  the 
disparity  had  been  explained.  The  names  of  prospective  jurors 
were  drawn  from  poll  and  property  tax  lists,  and  Negroes  ap- 
peared to  constitute  a much  smaller  percentage  of  such  lists  than 
of  the  general  population  — although  how  much  smaller  a per- 
centage did  not  appear  from  the  evidence.  The  decision  below 
was  affirmed. 

In  Speller’s  case,  jurors  were  selected  from  the  tax  lists.  Thirty- 
eight  percent  of  the  taxpayers  were  Negroes,  but  only  7%  of  the 
persons  selected  by  the  clerk  for  the  jury  box  were  Negroes.  The 
Court  held,  however,  that  the  discrepancy  could  be  explained  by 
the  testimony  of  the  clerk  that  he  took  those  witn  “the  most 

w Id.  at  ti 9. 
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property.”  It  concluded  that  “evidence  of  discrimination  based 
solely  on  race  In  the  selection  actually  made  is  lacking.” 31 

The  apparent  reluctance  of  the  courts  lo  consider  underrepre- 
sentation as  evidence  of  discrimination  has  been  confirmed  by  two 
recent  cases,  which  will  be  considered  in  some  detail  in  Part  III. 
In  the  first  case,  Sivaln  v.  Alabama,*2  Swain,  a Negro,  was  con- 
victed of  rape  in  Talladega  County,  Alabama,  and  sentenced  to 
death.  The  Supreme  Court  summarized  the  evidence  on  the  panels 
as  follows:  “White  Negro  males  over  21  constitute  26^0  of  all 
males  in  the  county  in  this  age  group,  only  10  to  15%  of  the  grand 
and  petit  jury  panels  drawn  from  the  jury  box  since  1953  have 
been  Negroes,  there  having  been  only  one  case  in  which  the  per- 
centage was  as  high  as  23%.”  33  In  Swain's  case  there  were  four 
or  five  Negroes  on  the  panel  of  thirty-three  from  which  lus  grand 
jury  was  selected  and  eight  Negroes  on  the  panel  of  one  hundred 
from  which  his  petit  jury  was  selected.  The  Court  held  that  these 
statistics  did  not  constitute  prima  facie  evidence  of  discrimina- 
tion: “We  cannot  say  that  purposeful  discrimination  based  on 
race  alone  is  satisfactorily  proved  by  showing  that  an  identifiable 
group  in  a community  is  underrepresented  by  as  much  as  10 fo. 
. . . The  overall  percentage  disparity  lias  been  small,  and  re- 
flects no  studied  attempt  to  include  or  exclude  a specified  number 
of  Negroes.”  31  As  we  shall  demonstrate,  this  result  is  inconsistent 
with  the  announced  principles  governing  these  cases. 

Swain  also  attacked  the  petit  jury  which  had  convicted  him, 
claiming  that  the  prosecution  had  used  its  peremptory  strikes  to 
exclude  Negroes.  The  evidence  showed  that  the  six  Negroes  avail- 
able for  service  in  Swain's  case  had  been  struck  by  the  prosecu- 
tion. The  Court,  three  Justices  dissenting,  rejected  the  claim.  It 
held  that  in  any  given  case  it  is  presumed  that  the  prosecution  has 
used  its  peremptory  challenges  to  secure  an  impartial  jury  and 
that  this  presumption  is  not  overcome  by  allegations  that  “in  the 
case  at  hand  all  Negroes  were  removed  . . . because  they  were 
Negroes.”35  In  the  Court’s  view  the  case  would  have  been  differ- 

51  Id , at  481. 

”3So  U.S.  302  (1565). 

M Id.  at  305. 

34  Id.  at  20S-09.  The  dcceptiven:55  of  this  statement  was  called  to  the  Court’* 
attention.  Brief  of  the  American  Civil  Liberties  Union  as  Amicus  Curbs  Li  Sup- 
port or  Petition  for  Rehearing  is  I?  93^  of  the  population  and  3oy?  of  the 

average  panel  is  Negro,  the  disparity1  any  be  insignificant.  This  \s  not  so 

If  ror*  of  the  population  is  Negro  b'jt  r.o  panelists  are  Negro. 

**380  US.  at  2:2. 
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ent,  and  the  fourteenth  amendment  claim  would  have  taken  on 
added  significance,  if  defendant  had  shown  that  the  prosecutor  “in 
case  after  case,  whatever  the  circumstances,  whatever  the  crime 
and  whoever  the  defendant  or  the  victim  may  be,  is  responsible  for 
the  removal  of  Negroes  who  have  been  selected  as  qualified  jurors 
by  the  jury  commissioners  and  v,-ho  have  survived  challenges  for 
cause  . . . 30  But  although  Negroes  had  never  served  on  petit 

juries  in  Talladega  County,  Swain  had  failed  to  prove  his  case 
because  he  had  not  shown  that  the  prosecution  had  been  respon- 
sible in  each  case  for  striking  Negroes. 

The  Court  has  thus  drawn  the  line  between  a peremptory  strike 
based  on  a belief  that  a Negro  juryman  would  be  biased  in  a par- 
ticular case  and  the  systematic  use  of  peremptory  strikes  to  keep 
Negroes  off  juries.  The  first  is  permitted,  the  second  unconstitu- 
tional.37 

In  the  other  recent  case,  State  v.  Barksdale,**  a Negro  was  con- 
victed of  rape  and  sentenced  to  death.  Tne  evidence  showed  that 
Negroes  in  Orleans  Parish  composed  about  ggfc  of  the  adult 
male  population  but  only  io % to  1 6%  of  the  grand  jury  venires 
and  ri%  to  10%  of  the  petit  jury  venires.  From  May  195S,  when 
the  Supreme  Court  reversed  a murder  conviction  in  the  parish  on 
the  ground  of  exclusion  of  Negroes  from  the  grand  jury,”  until 
September  196a,  when  Barksdale  was  indicted,  nine  grand  juries 
had  been  selected.  Eight  of  these  contained  two  Negroes  each, 
and  one  Negro  served  on  the  ninth.* *"  On  this  evidence  the 
Supreme  Court  of  Louisiana  affirmed,  holding  that  the  smaller 
proportion  of  Negroes  serving  on  grand  juries  was  adequately  ex- 
plained by  their  deficiencies  in  education  (which  resulted  in  the 
qualification  of  fewer  Negroes)  and  income  (which  led  more 
Negroes  to  request  hardship  excuses).*1  The  Supreme  Court 
denied  certiorari.  The  denial,  of  course,  docs  not  mean  that  the 
Court  approved  the  state  court’s  holding.  I shall  try  to  show 
that,  when  put  to  a mathematical  test,  the  facts  in  Barksdale  point 
to  a contrary  decision. 

The  opinions  of  the  Court  in  underrepresentation  cases  leave 
little  doubt  that  disparity  between  the  proportion  of  Negroes  on 
venires  and  in  the  population  generally  is  evidence  of  dis- 

J®  Id.  at  jsj. 

*7 See  .Vote,  Fcir  Jury  Selection  Procedures,  75  Yale  L.J.  .1 : : (1965). 

” 147  La.  t$3,  r;o  S')  id  374  (1954),  Ctrl,  denied,  j3i  US.  9:1  (1 9S3). 

**  Eubanks  v.  Louisiana,  33$  U S.  3S4  ( : 19 5 a > . 

M 147  La.  at  :ij,  170  So.  :d  at  3S3. 

41  Id.  at  aaj-n,  t;o  So.  :d  at  3S4. 
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criminatory  'election  if  it  is  large,  continuing,  and  unexplained. 
But  it  is  also  clear  that  a.  defendant  is  not  entitled  to  a venire  or 
a jury  on  which  members  of  his  race  are  proportionately  repre- 
sented, and  efforts  to  create  proportional  representation  t;. 
systematically  including  Negroes  have  been  held  uncon-i'- 
tional."  Consequently,  not  every  disparity  in  proportions  is 
constitutionally  fatal.  “The  mere  fact  of  inequality  in  the 
number  selected  does  ret  in  itself  show  discrimination. ''  " 

The  refusal  of  the  Court  to  grant  relief  in  Steam  has  thrown 
into  sharp  relief  the  p'cbkrn  of  drawing  a line  between  significant 
and  insignificant  disparities.  Counsel  in  Such:  asserted  flatly 
that  “the  State  must  surely  offer  some  explanation  of  why  tire 
proportion  of  Negroes  on  grand  and  petit  jury  venires  averages 
at  most  one-half  the  proportion  of  eligible  Negroes  in  the 
population.""  But  neither  Court  nor  counsel  offered  a rational 
way  of  determining  w hich  disparities  verc  large  enough  to  be  con- 
sidered evidence  of  design.  As  a resin  , counsel  could  do  little 
more  than  assert,  without  reasoned  support,  that  the  statistics 
tequired  a finding  of  discrimination.  And  when  the  Court  reached 
a contrary  conclusion  in  on  ;*«,  the  basis  for  its  decision  remained 
equally  Delphic. 

In  the  exclusion  cases  the  Court  has  consistently  refused  to 
countenance  the  explanation  that  Negroes  were  omitted  from 
juries  because  they  were  uaqvili* ed.  The  explanation  arose  again 
in  the  underrepresentation  cases,  this  time  in  more  compelling 
form,  since  all  that  hud  to  b?  asserted  was  that  some  but  not  all 
Negroes  were  unqualified  In  Ccudl,  Brown,  and  Speller,  the 
Court  accepted  such  assertions,  refusing  to  find  discrimination 
on  the  basis  of  disparities  in  f sreentages.  In  Caste!!  the  evidence 
showed  that  the  average  props -ti.-n  of  Negroes  on  grand  juries 
was  at  least  approximately  equal  to  their  proportion  on  the  tax 
rolls;  but  in  Brew:  ar.  l >/  '.  /■  the  evidence  showed  only  that 
the  number  of  cligb-e  Negro-:?  wa-  to  some  undetermined  extent 
smaller  than  their  rrmier  in  the  ad. At  male  population.  On  xvhat 
theory  the  Court  found  r«n quantitative  explanations  sufficient 
does  not  appear. 

The  failure  of  the  Court  to  articulate  a rationale  for  the  under- 
representation  cases  is  a ccr.s:quer.ce  of  the  problem  they  present. 

"Cl,  C*r.-I  v.  Wit.t-.  : ■ F ici,  tfd  •*!  ntwi^i,  F :t  »?r  f 
Cif  K ttrt  yt$  i*  $ <i§*iV 
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In  the  exclusion  cases  the  disparity  is  gross  enough  for  the  intui- 
tive conclusion  that  jurors  were  chosen  on  the  basis  of  -.ace.  But 
the  notion  that  sc  me  Negroes  should  have  been  chosen  is  too  im- 
precise to  offer  guidance  once  some  Negroes  are  in  fact  chosen. 
Since  the  judges  have  recognized  that  not  every  variation  from 
proportionate  numbers  is  evidence  of  design,  they  arc  confronted 
with  the  harder  problem  of  drawing  a line  between  significant  and 
insignificant  variations  and  the  related  problem  of  determining 
the  effect  to  be  given  to  •’explanations"  for  significant  variations. 

The  resolution  of  these  problems  involves  important  questions 
of  legal  policy  which  the  Court  has  never  reached  because  the 
legal  issues  have  remained  intertwined  with  statistical  ones.  We 
need  not  more  tender-minded  Justices  or  more  appealing  verbal 
formulations  but  an  analysis  of  the  data  wh:ch  will  expose  the 
true  legal  issues. 


II 

Probability  theory  has  been  found  to  apply  to  events  commonly 
called  "random”  or  “chance."  The  result  of  a coin  toss  is  the 
classic  example.  Theoretically,  at  least,  the  person  tossing  the 
coin  might  determine  the  outcome  in  advance  by  calculating  the 
forces  acting  on  the  coin.  Put,  since  minute  changes  in  some  of 
these  forces  can  produce  a different  outcome,  there  is  for  all 
practical  purposes  no  way  of  predetermining  the  Outcome.  Con- 
sequently, the  result  is  treated  as  independent  cf  the  volition  of 
the  actor.  It  is  this  practical  independence  of  result  from  cause 
that  characterizes  phenomena  suited  to  the  application  of  prob- 
ability theory.41 

It  Is  obvious  that  the  racial  outcome  of  jury  selection  is  — or 
rather  should  be  — independent  of  the  method  of  selection.  This 
Is  true  rot  because  the  selecting  agent  is  unab’e  to  control  the 
result,  as  in  the  coin-tossing  case,  but  because  he  is  rot  legally 
permitted  to  do  so.  But  whether  the  outcome  is  independent  be- 
cause of  the  physical  impossibility  of  connecting  it  with  its  causes 
or  because  of  the  unconstitutionaiity  of  making  such  a connection, 
the  result  is  the  same:  the  racial  outcome  is  random. 

This  is  not  to  say  that  the  method  of  selection  for  the  jury  roll 
must  be  neutral  with  respect  to  the  number  cf  whites  and  Negroes 
considered.  The  use  of  various  qualification  tests,  such  as  literacy, 

•’Tfct  *f  fiffset  frcJictt--  ts  i'-StM  tvs*,  i.i  !v.*sry  betj-j't  cf 
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or  the  operation  of  the  selection  machinery,  for  instance  the  use 
of  telephone  listings,  may  result  in  the  qualification  or  considera- 
tion of  a disproportionately  small  number  of  Negroes.  But  within 
the  pool  of  eligibles,  however  restricted  in  size,  the  selections  must 
still  be  made  without  reference  to  race,  and  this  means  that  the 
racial  population  of  jury  venires  is  — or  should  be  — the  result 
of  random  selection  and  consequently  an  appropriate  subject  for 
probability  theory  analysis. 

Basing  itself  on  probability  theory,  statistical  decision  theory 
provides  us  with  a variety  of  tests  for  the  hypothesis  that  the 
number  of  Negroes  chosen  for  venires  is  consistent  with  random 
selection.  Like  all  statistical  tests,  these  are  not  infallible,  but 
their  rates  of  error  can  be  made  extremely  small  and  are  mathe- 
matically determinable.  There  is,  as  we  shall  show,  judicial  prece- 
dent for  deciding  cases  with  a rate  of  error  similar  to  the  rate  of 
error  present  in  the  tests  we  propose  to  use. 

The  Supreme  Court  has  used  an  approach  in  principle  identical 
to  this  when  holding  that  the  persistent  absence  of  Xegroes  from 
juries  is  evidence  of  discrimination.  Although  die  basis  for  de- 
cision has  usually  not  been  articulated,  the  Court  has  indicated 
on  a few  occasions  that  its  holdings  rest  on  the  improbability  of 
repeated  random  selection  of  all  white  venires  in  counties  with 
Substantial  Negro  populations.  Thus,  in  Smith  r.  Texts,  the 
Court  said:  “Chance  and  accident  alone  could  hardly  have 
brought  about  the  listing  for  grand  jury  service  of  so  few  negroes 
from  among  the  thousands  shown  by  the  undisputed  evidence  to 
possess  the  legal  qualifications  for  jury  service.’’ 4* 

Although  stated  as  a single  rule,  the  Court’s  theory  in  these 
cases  involves  two  propositions.  The  first  is  that,  ff  the  number 
of  Negroes  chosen  is  an  improbable  result  of  random  selection, 
that  fact  is  prima  facie  evidence  of  discrimination.  This  approach 
justifies  our  use  of  probability  and  statistical  analysis  in  these 
cases.  As  we  shall  see,  improbability  may  be  demonstrated  in  a 
Dumber  of  different  ways.  The  Court  has  explicitly  sanctioned  one 
of  these  wa>s  in  its  second  proposition:  a large  and  unexplained 
disparity  between  the  proportion  of  Negroes  on  venires  and  in 
the  population  generally  is  unlikely  to  result  from  random  selec- 
tion. This  approach  justifies  the  mathematical  model  which  we 
shall  use  to  determine  the  probabilities  in  these  cases. 

The  mathematical  test  which  can  be  used  to  determine  the 
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existence  of  discrimination  i$  based  on  the  probability  that,  as 
veniremen  are  selected,  any  gis'en  venireman  will  be  a Xegro. 
This  probability  is  usually  derived  from  the  proportion  of  adult 
Negro  males  to  the  total  adult  male  population.  In  each  jury  case, 
however,  it  has  been  urged  that  these  population  figures  are  mis- 
leading and  that  the  absence  of  Negroes  from  venires  or  juries 
reflects  a lack  of  literacy,  of  integrity,  or  of  other  qualifications 
required  of  jurors.  Statistical  methods  can  be  fruitfully  used  here 
to  determine  the  percentages  of  Negroes  and  whites  it  would  be 
tcccssiry  to  assume  disqualified  in  order  to  account  for  the  ob- 
served numbers  of  Negroes  on  venires.  Whether  these  percentages 
are  consistent  with  the  education,  status,  and  general  qualifica- 
tions of  the  Negro  and  white  papulations  is  a question  which  the 
Court  can  approach  with  some  confidence  since  it  resembles  other 
factual  issues  underlying  constitutional  claims. 

The  Court  has  taken  a similar  but  fionquar.titative  approach 
to  the  qualifications  issue  in  the  cases  v e have  discussed.  In  the 
exclusion  cases,  the  contention  that  the  absence  of  Negroes  from 
juries  could  be  explained  by  their  lack  of  qualifications  was  re- 
jected because  it  involved  the  assumption  (so  the  Court  believed) 
that  virtually  the  entire  Negro  population  was  unqualified.4* 
"(A]  race  (cannot]  be  proscribed  as  incompetent  for  service.”  M 
Similar  reasoning  appears  in  the  underrepresentation  cases.  In 
Speller  v.  Allen,**  for  example,  the  Court  held  that  a discrepancy 
of  thirty-one  points  between  the  percentage  of  Negroes  on  venires 
and  the  percentage  on  tax  lists  from  which  the  venires  were 
drawn  could  not  be  explained  by  the  clerk's  disqualification  of 
Negroes  for  poor  moral  character.  "It  would  not  be  assumed,” 
the  Court  held,  “that  in  Vance  County  there  is  not  a much  larger 
percentage  of  Negroes  with  qualifications  of  jurymen.”  * The 
same  method  of  reasoning,  using  mathematical  techniques,  will 
be  adopted  here. 

In  certain  situations  it  is  impossible  to  determine  whether  the 
number  of  Negroes  appearing  on  a series  of  venires  or  juries  is 
consistent  or  inconsistent  with  the  qualifications  of  the  Negro 
population.  This  may  occur  when  there  is  inadequate  evidence 
concerning  qualifications  or  when  persons  selected  for  service  may 
be  excused  on  grounds  other  than  lack  of  qualifications.  Those 
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called  for  grand  jury  service,  for  instance,  may  be  excused  on 
grounds  of  economic  hardship.  Another  example  Is  the  appear- 
ance of  Negroes  on  petit  juries  (as  distinguished  from  jury 
panels).  Although  in  Su-ain  the  Supreme  Court  stated  that  the 
prosecution  may  not  use  peremptory  strikes  to  exclude  Negroes 
systematically,  there  remains  a broad  area  of  discretion  in  the 
use  of  such  strikes  or  challenges  which  makes  it  virtually  impos- 
sible to  determine  from  population  statistics,  however  carefully 
refined,  the  probability  that  a Negro  will  appear  on  a petit  jury. 

While  in  such  cases  it  is  r.ot  possible  to  use  a statistical  analysis 
based  on  the  failure  to  select  a greater  number  of  Negroes,  it  is 
possible,  at  least  in  the  case  of  venires  and  grand  juries,  to  de- 
termine whether  the  number  of  Negroes  selected,  which  I shall 
call  the  distribution  of  Negroes,  is  consistent  with  randomness. 
To  take  a crude  example,  the  appearance  of  a series  of  grand 
juries  alternately  ico'/c  and  ofc>  Negro  would  excite  our  sus- 
picions as  to  the  selection  process  whatever  the  probability  of 
selecting  a Negro  or  white  juror.  Statistical  decision  theory  pro- 
vide;- a measure  of  the  extent  to  which  a given  distribution  differs 
from  the  theoretical  or  expected  random  distribution  and  a test 
of  the  significance  of  such  difference.  The  test  will  indicate, 
within  mathematically  determined  limits  of  accuracy,  whether 
the  nature  of  the  observed  distribution  is  so  at  variance  with  th  ; 
expected  distribution  that  the  hypothesis  of  random  selection 
ought  to  be  rejected. 

Although  the  Court  has  rover  applied  this  line  of  reasoning,  the 
test  of  distribution  operates  on  the  same  principles  as  the  test 
based  on  a comparison  of  percentages  which  it  has  used.  Perhaps 
the  closest  thing  we  have  to  judicial  recognition  that  a skewed 
distribution  may  be  evidence  of  discrimination  Is  the  statement  of 
the  three  concurring  Justices  in  Cassell  v.  Texas : *l 

The  number  of  Negroes  both  qualified  and  available  for  fury  service 
in  Dallas  County  precluded  such  uniform  presence  of  never  more 
than  one  Negro  on  any  other  bas;s  oi  good  faith  than  that  the  com- 
missioners were  guided  by  the  belief  that  one  Negro  on  the  grand 
jury  satisfied  the  prohibition  against  discrimination  .... 

In  concluding  that  probability  theory  can  usefully  be  applied 
to  these  problems,  it  has  been  assumed  that  the  Intuitive  idea  of 
probability  used  by  the  Court  is  conceptually  similar  to  the 
mathematical  definition  used  by  the  statisticians.  Although  the 
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premise's  of  the  legal  argument  remain  obscure,  the  assumption  is 
a reasonable  one,  since  the  intuitive  content  of  the  mathematical 
definition  is  the  common  idea  of  probability.  Thus  the  mathe- 
matical notion  of  an  event  with  a probability  of,  say,  o.t  corre- 
sponds to  the  practical  notion  of  an  event  that  occurs  on  the 
average  of  once  in  every  ten  trials/1 

III 

We  now  apply  the  methods  outlined  above  to  Swain  v.  Alabama 
and  Barksdale  v.  Louisiana.  The  first  case  involves  the  selection 
of  N'cgroes  for  venires;  the  second,  their  appearance  on  grand 
juries. 

A.  An  Analysis  cj  the  Selection  o]  Veniremen 

In  Swain,  it  will  be  recalled,  there  was  a claim  of  discrimination 
in  the  selection  of  the  grand  arid  petit  jury  venires  and  in  the 
selection  of  petit  jurors.  We  begin  our  examination  with  a general 
mathematical  analysis  of  venire  selection.  The  application  of 
these  methods  to  the  facts  in  Swain  will  be  confined  to  the  grand 
jury  venires,  since  the  record  with  respect  to  the  petit  jury  venires 
is  not  sufficiently  clear  to  nermil  the  use  of  mathematical  methods. 

The  process  of  selecting  veniremen  can  be  treated  as  what  is 
known  in  probability  theory  as  a scries  of  Bernoulli  trials.11  Re- 
peated trials  are  called  Bernoulli  trials  if  (/)  there  are  only  two 
possible  outcomes  for  each  trial,  (a)  the  tria’s  are  independent, 
and  (j)  the  probabilities  for  each  outcome  remain  constant 
throughout  the  trials.  The  repeated  tossing  of  a win  is  a series 
of  Bernoulli  trials  because  there  are,  theoretically,  only  two  out- 
comes for  each  toss,  a head  or  a tail,  the  tosses  are  independent 
in  the  sense  that  the  outcome  of  one  does  not  affect  the  outcome 
of  another,  and  the  probability  of  tossing  a head  or  a tail  remains 
constant  throughout  any  series  of  losses.  Simiinrily,  when  venire- 
men are  selected,  there  are  only  two  racial  results  possible,  white 
and  nonwhite  (we  equate  the  latter  whh  Negro),  the  result  of 
each  selection  is  independent  of  other  selections,  and  it  is  reason- 
able to  assume  that  the  probibilties  of  selecting  a Xegto  or  a white 
remain  constant  throughout  ti  e selection  process. 

Strictly  speaking,  selecting  veniremen  is  not  a series  of 
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Bernoulli  trials  because  each  person  selected  is  eliminated  at 
least  for  a certain  time  from  the  pool  of  potential  jurors.  Theo- 
retically, therefore,  the  probability  of  selecting  a white  or  a 
Negro  on  any  given  trial  will  be  dependent  cn  the  history  of  selec- 
tion to  the  extent  that  the  racial  proportions  of  the  available 
population  are  changed  by  the  elimination  of  those  already  chosen. 
Although  it  is  possible  to  take  this  into  account  in  computing 
probabilities,  it  is  not  necessary  to  do  so.  The  number  of  venire- 
men selected  will  in  most  cases  change  the  racial  proportions  of  the 
available  population  loan  insignificant  degree.  Moreover,  In  these 
cases  whites  are  usually  selected  with  a frequency  which  equals 
or  exceeds  their  proportion  of  the  population.  The  assumption 
that  the  proportion  of  white  and  Negro  eligible*  remains  constant 
is  conservative  since  the  proportion  of  white  cligibles  actually 
declines  during  the  selection  process.  Assuming  that  the  propor- 
tion remains  constant  makes  an  improbably  large  percentage  of 
white  veniremen  seem  slightly  more  in  accord  with  random  selec- 
tion than  is  really  the  case.  For  these  reasons,  we  adopt  the 
simplification  that  the  selection  process  consists  of  independent 
trials  with  constant  probabilities  applicable  to  each  trial. 

The  derivation  of  the  probabilities  associated  with  Bernoulli 
trials  rests  on  two  basic  propositions.  The  first  is  the  sum  rule, 
which  states  that  the  probability  of  the  occurrence  of  any  one  of 
a number  of  mutually  exclusive  events  is  equal  to  the  sum  of  the 
probabilities  o'  <ose  events. s*  The  second  is  the  product  rule, 
which  states  t -i  the  probability  of  the  joint  occurrence  of  a 
Dumber  of  independent  events  is  equal  to  the  product  of  the  indi- 
vidual probabilities.”  Discussion  of  these  elementary  proposi- 
tions may  be  found  in  textbooks  on  probability  or  statistics.” 

Applying  the  product  rule,  if  P is  the  probability  of  selecting 
a single  Negro  venireman,  the  probability  of  selecting  two  Negro 
veniremen  is  P X P,  or  Pi.  More  generally,  the  probability  of 
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selecting  v Negro  veniremen,  where  v is  any  number,  is  P’.  Sim- 
ilarly, if  Q is  the  probability  of  selecting  a single  white  venire- 
man,57 then  the  probability  of  selecting  w white  veniremen  is 
QJ.  If  Q ~ as  it  may  if  whites  comprise  approximately  three- 
quarters  of  the  eligible  population,  and  if  there  arc  thirty  venire- 
men on  a panel,  the  chance  of  picking  a single  panel  without  any 
Negroes  is,  by  the  product  rule/*  Q(*n>  - 30)  = (^)M  = 0.0002. 
This  means  that  on  the  average  only  two  venires  in  ten  thousand 
would  be  all  white.  If  every  venire  were  all  white  or  all  Negro 
our  analysis  could  stop  here.  But,  since  in  Sarin  and  most  other 
cases  both  whites  and  Negroes  were  in  fact  chosen,  some  further 
analysis  is  necessary  to  derive  the  relevant  probabilities. 

If  P is  the  probability  of  selecting  a single  Negro  and  Q the 
probability  of  selecting  a single  white,  PQ  is,  by  the  product  rule, 
the  probability  of  selecting  a Negro  and  a white  in  that  order. 
The  order  of  selection,  however,  is  unimportant.  The  condition 
of  selecting  a Negro  and  a while  will  also  be  satisfied  by  first 
selecting  a while  and  then  selecting  a Negro.  The  total  proba- 
bility of  selecting  a Negro  and  a white  without  regard  to  order 
is  thus  PQ  -}•  QP  - 2 PQ.  In  general,  the  probability  of  select- 
ing a particular  mixture  of  Negroes  and  whites  is  the  probability 
of  selecting  that  mixture  in  a particular  order  times  the  number 
of  different  ordeis  in  which  the  mixture  may  bo  selected.51 

If  n is  the  number  of  men  on  a venire,  the  probability  of  select- 
ing v Negroes  is  Pr  and  the  probability  of  selecting  n - v whites  is 
Since  these  events  are  independent,  their  product,  P'Q 
Is  the  probability  of  selecting  v Negroes  and  n - v whites  in  a par- 
ticular order.  The  second  clement  is  the  number  of  ways  this 
mixture  may  be  selected.  The  number  of  distinguishable  order- 
ings of  n objects  of  which  v are  of  one  class  and  » - v of  another 
class  is:  M 
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These  two  elements  (the  probability  for  each  ordering  and  the 
number  of  distinguishable  orderings)  multiplied  together  yield 
the  well-known  "binomial  distribution,"  which  expresses  the  prob- 
ability P(v)  of  selecting  v Negroes  out  of n veniremen: 


The  challenge  to  the  selection  system  we  are  discussing  hero 
must  be  based,  not  on  the  probability  of  selecting  a particular 
number  of  Negroes  (which  is  what  the  above  formula  gives  us), 
but  on  the  probability  of  selecting  not  more  than  i Negroes.  By 
the  sum  rule,  the  probability  of  selecting  not  more  than  i Negroes 
b the  sum  of  the  probabilities  of  selecting  o,  i,  a , i Negroes. 
This  sum  is  expressed  symbolically  as  follows: 


With  this  preparation,  we  arc  ready  to  consider  the  grand  jury 
venires  in  Su-jl/t.  The  tecord  shows  that  for  about  ten  years 
Negroes  had  never  accounted  for  more  than  ij/c  of  the  grand 
jury  venires,  with  the  single  exception  of  a 13^0  Negro  venire 
chosen  in  the  summer  of  1955.  These  venires  consisted  of  ap- 
proximately 30  persons  each,  and  three  to  four  were  chosen  every 
year M Census  figures  showed  that  adult  Negro  males  constituted 
approximately  s6£o  of  the  adult  male  population.  For  purposes 
of  discussion  we  assume  that  between  1955  and  196a  thirty  venires 
were  chosen  consecutively  with  five  or  fewer  Negroes  each,  and 
that  between  1953  and  1955  five  additional  venires  were  setected 
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one  of  which  was  23^0  Negro.  We  assume  further  that  Negro 
males  constituted  one-quarter  of  the  adult  male  population. 

Applying  the  binomial  distribution  previously  derived,  the 
probability  that  not  more  than  five  Negroes  would  appear  on  any 
thirty-man  venire  is: 


On  the  assumption  that  the  probability  P of  selecting  a Negro 
venireman  is  one-quarter  and  the  probability  Q of  selecting  a 
white  venireman  is  thus  three-quarters  the  above  expression  be- 
comes: *J 


This  result  means  that  about  one  venire  in  five  should  have  five 
or  fewer  Negroes. 

In  Swcin,  however,  venires  with  five  or  fev%er  Negroes  appeared 


applying  the  product  rule,  is  o.j?J  - 4 .63  X ro“!1.  This  means 
that,  on  the  average,  only  one  ir.  more  than  one  hundred  million 
trillion  groups  each  containing  thirty  venires  would  consist  solely 
of  venires  which  were  not  more  than  15'c  Negro.  If  thirty  jury 
venires  were  selected  at  random,  in  Talladega  County  every  day 
of  the  year,  the  daily  selection  would  correspond  to  the  facts  in 
Sicahi  only  one  day,  on  the  average,  in  thousands  of  trillions  of 
years/* 1 
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thirty  that  tn  th*  *timp?e  of  the  tut  * = 5;  t » v 4*  1 » No  jj. 
Tfce  table  shawl  the  take  ht  = • ; 9?'*  «K:h  subtracted  frorfl 

1 oecco  ykVJs  P«'*e£$)  * e 20:$r  St Jtutirrwnn  fre-jenti/  compute  the  vitae  ci 
F(r)  b»  *«ir?  i ppfvt;m t ti :M  r.'ch  a?  tta  er  torn u*  :^>iba:Jow.  F’ t 

W,  Fnm,  ,\5  I.vteootxtwx  to  Fa-imam  t*uc«Y  a:  0 li s .Vm*CAt»\s  14? 
(:d  ed.  s 9 J ?) - 

TV*  tbit  thirty  utb  vtrrts  - ere  t>  >>:n  in  1 to^v  ttrtlvu  or.ty 

ftnerd  support  from  the  record  since  it  Is  n;t  d:if  etictly  her*  rainy  *ere 
c Vo  sen.  Record  at  to.  Bat  c or,  eta**  -i  :<-r:in  tfcbixs  a broii 

tiru»;on  In  ftamtart.  If  orly  twenty  vtr.;?:«  tttre  chosen  iritnl  of  thirty,  the 
prohibit)  thit  at  »c«!d  ha\e  £vt  or  (r r;r  Ntpori  woahJ  be  the  ire^ijiYt 
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If  all  the  Swabs  venires  selected  since  1953  are  Included,  the 
total  rises  1035,  and  one  venire  was  23^  Negro.  Although  the  nu- 
merical result  is  not  significantly  changed  by  considering  this 
data,  we  shall  do  so  to  illustrate  an  extension  of  the  method. 

We  seek  the  probability  of  the  occurrence  of  thirty-four  venires 
with  five  or  fewer  Negroes  and  one  venire  with  seven  or  fewer 
Negroes.  This  can  be  expressed  as  the  sum  of  the  probability  of 
thirty-five  venires  with  five  or  fewer  Negroes,  which  is  P( v ** 
5),J,  and  the  probability  of  thirty-four  venires  with  five  or  fewer 
Negroes  and  or.e  venire  with  six  or  seven  Negroes.  This  latter 
probability,  using  the  binomial  distribution,  is  35V34I1I  P(v 
5) 11  P(6  ^ v 7).  The  sum  of  these  two  probabilities  is: 

P(t)  *=  P{v  ^ j)»  4-  T^r(v  £?  j)»«  P( 6 eS  « *s  7) 

*=  (o.»)”+(3$)  (0.2 )*'  (0.3) 

» 1.85  X io~!J 

This  is  about  twenty-five  times  smaller  than  the  probability  com- 
puted without  the  additional  data. 

It  is  instructive  to  apply  these  methods  to  Cas*cU  v.  Texas*' 
(although  that  case  involved  jurors  and  not  veniremen)  where 
Information  concerning  the  grand  juries  was  more  precise  than  in 
Swain.  Of  the  twenty-one  grand  juries  considered  in  Cassell,  a 
single  Negro  appeared  on  seventeen  and  none  on  the  remaining 
four.  Negroes  comprised  6.3^  of  the  eligible  population.  The 
Couil  held  that  these  statistics  did  not  constitute  evidence  of 
discrimination  since  the  average  number  of  Negroes  on  the  juries 
when  considered  as  a group  was  6.7 fc.  This  did  not  appear  to 
differ  significantly  from  their  proportion  in  the  population  of 
eligible?.  Mr.  Justice  Reed  and  the  three  Justices  who  joined 
In  his  opinion  did  not  base  their  reversal  of  the  conviction  on  the 
improbability  that  a series  of  juries  would  be  selected  at  random 

&ot  accept  this  Interpretation,  and  the  trd^M  of  the  evidence  fcctcj  to  favor  the 
Court's  understanding  (Ml  iris  the  r:itiwn  propCflios  of  Negroti.  $ft  jtl 
05-  *1  taj.  Why  petitioner's  course!  adapted  t>L«  kii  fivorab’e  vie*  of  the  e\i- 
derxe  Is  mystifying.  Vet  ever  the  probability  that  venires  would  average  tp^e 
l?e|to  U minute.  If  %t  i-ain  a?«Brr.t  the  selection  of  ventres  irith  jo  members 
each  cut  of  the  pea  veniremen  s:tuted*  15/#  er  tjj  ire  taVen  to  hive  been  Nrarcei. 
While  the  etict  tosputUkva  of  the  p*ohabiHty  of  tvU  occurrence  ao*M  be 
laborious,  its  smiHreii  is  ifdkitNl  by  the  fact  Utt  the  of  thoosirj 

tat  teore  thin  ijc  NVfrcel  is  !e?s  thin  o.eoc-oi.  Sft  Hiiva).!  CoiiettAtrw  L>t- 
Of.UDir,  ft,V4  rote  Oi,  it  raj.  The  p*bbabC?ty  of  thoos2.-.g  net  more  tin  :j; 
Kejroei  be  lab-tittu^y  kii  tbit  ihii,  ihk«i  lir^f  lSi«n  tbit  coxpa*:d 

on  the  Court's  assumption. 

<4  m 0*5  :Sj  (195c). 


Michael  0.  Finklestein 


in  which  the  proportion  of  Negroes  never  exceeded  to  any  large 
degree  their  proportion  in  the  population.  On  the  assumption  that 
P *=0.065  and  Q — 0.935,  and  using  the  formulas  previously  de- 
rived, we  have  P(v  *£  1 ) = 0.S1S.  The  probability  that  not  more 
th3n  one  Negro  would  appear  on  each  of  ;i  venires  (a  better  dis- 
tribution than  that  actually  observed  since  there  were  four  venires 
without  Negroes)  is  0.S1S  :'  = o.oi5.  Whether  this  result 05  sup- 
ports  Justices  Frankiurtcr,  Burton,  and  Minton,  v.ho  grounded 
then  t ...currencc  on  the  improbability  that  ro  more  than  one 
Negro  would  ever  appear  on  a venire,  depends  on  the  statistical 
test  used  to  evaluate  the  probabilties. 

Our  computation  of  (he  probabilities  in  5vc;«  was  based  on  the 


assumption  that  the  selection  of  veniremen  corresponded  to  a 
series  of  Bernoulli  trials  with  a probability  P - } 4 of  selecting  a 
Negro  venireman.  On  this  assumption  there  was  apparently  only 
a minute  probability  that  so  few  Negroes  would  be  selected. 
Since  an  event  with  such  an  apparently  small  probability  did 
occur,  we  are  led  to  reject  the  basis  on  which  the  probabilities 
were  computed,  namely,  the  assumption  that  venire  selection  in 
Swain  consisted  of  Bernoulli  trials  with  th?  probability  P = }$ 
of  selecting  a Negro  venireman.  This  reasoning  is  formalized  as 
the  following  statistical  test: 

The  assumption  thet  the  rectal  result  of  ’centre  selection  corre- 
sponds to  a series  of  Bernoulli  trials  uitfi  probability  P - \\  of 
Selecting  c single  Xcgro  inure  men  is  rejected  lohiU.c’.cr  the  prob- 
ability of  selecting  not  more  than  the  obsentd number  of  Scgrocs 
an  mures  is  less  then  a ccrto^i  critical  %'alue.  For  purposes  of 
illustration  the  critical  value  P(t)  = 0.05,  the  value  most  com- 
monly used  by  statisticians,  is  selected. 

Since  the  probability  of  selecting  net  more  than  the  observed 
number  of  Negroes  on  venires  in  Suain  was  vastly  smaller  than 
o.o$,  application  of  the  test  leads  us  to  teject  the  hypothesis  that 
the  selection  of  veniremen  in  Stxin  corresponded  to  the  assumed 
series  of  Bernoulli  trials.  Rejecting  the  hypothesis  that  venire- 
men were  selected  at  random,  with  the  probability  of  selecting  a 
Negro  r^ttal  to  the  proportion  of  Negress  in  the  population,  docs 
not  in  itself  imply  discrimination  in  the  selection  process.  Tech- 
nically speaking,  all  we  kno.v  is  tha'  the  assumed  Bernoullian 
model  with  P *>  \\  is  not  consistent  with  the  observed  facts.  To 


'’Ike  t u'i  Itrt  tusnthi  till  rm>  { U ft**:.  Cun 
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determine  whether  there  is  discrimination  we  must  consider  the 
alternative  hypothesis  that  P does  not  equal  J4-  One  fourth  of 
the  adult  males  in  Talladega  County  were  Negroes.  But  if  — as 
the  Court  was  apparently  willing  to  believe  — the  jury  commis- 
sioners considered  a larger  proportion  of  Negro  adult  males  than 
white  adult  males  to  be  unacceptable  for  jury  service,  P would  be 
less  than  l/j,,  This  would  only  be  unconstitutional  discrimination 
if  the  commissioners  applied  higher  standards  for  Negroes  than 
for  whites  or  if  they  excluded  from  their  consideration  a substan- 
tial number  of  qualified  Negroes.6®  Since  the  absence  of  qualified 
Negroes  is  the  only  constitutionally  valid  reason  for  accepting  a 
smaller  value  for  P,  the  qualifications  issue  which  lurked  in  the 
wings  is  now  on  stage. 

The  statute  governing  jury  selection  in  Talladega  County, 
which  is  not  untypical,  provides  that  a juror  must  be  a male 
citizen,  generally  reputed  to  be  honest  and  intelligent,  esteemed 
in  the  community  for  integrity,  good  character,  and  sound  judg- 
ment, over  the  age  of  twenty-one,  not  a habitual  drunkard,  able 
to  read  English  (unless  he  is  a freeholder),  and  never  convicted 
of  an  offense  involving  moral  turpitude.67  The  Court  pointed  out 
in  Swain  that  the  method  used  to  find  qualified  jurors  need  not 
be  perfect  in  the  sense  that  every  segment  of  the  population 
receives  an  equally  scrupulous  canvass.6* * **  The  nature  and  range 
of  qualifications  required  of  jurors  and  the  latitude  allowed  in 
searching  out  veniremen  combine  to  make  the  qualifications  issue 
a substantial  problem. 

Analysis  of  this  problem  requires  a reversal  of  our  former  pro- 
cedure. Before,  the  probability  P(v)  of  observing  not  more  than 
v Negroes  on  the  venires  was  calculated  from  P,  the  probability 
of  selecting  a single  Negro  venireman.  We  now  seek  a value  for 
P which  will  yield  a value  for  P(v)  which  is  equal  to  or  greater 
than  the  critical  value  0.03  and  would  thus  by  our  test  result 
in  the  acceptance  of  the  Bernoullian  hypothesis.  In  other  words, 
if  it  is  contended  that  P is  really  less  than  l/\ , v;e  will  inspect 
other  values  of  P to  determine  how  small  it  must  become  before 
P(v ) would  satisfy  the  statistical  test.  P thus  determined  is  the 
largest  proportion  of  eligible  Negroes  to  the  total  eligible  popula- 


66  Cassell  v.  Texas,  3313  U.S.  :S:  (1930).  Bui  see  Rabinowitz  v.  United  States, 
3S  U.S.L.U'.  ao5i  (sth  Cir.  July  :o,  1956)  (en  banc)  (federal  jury  lists  must  be 
“fair  cross-section”  of  community). 

•T  Ala.  Code  tit.  30,  } :i  (:93s). 

**380  U.S.  at  ac3,  :o,). 
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tion  which  it  is  possible  to  assume  and  still  obtain  a probability 
for  the  observed  results  which  is  not  less  than  the  critical  value 
0.05. 

In  making  these  determinations  it  13  mathematically  possible 
to  reverse  things  and  to  express  P as  a function  of  P{v).  This, 
however,  involves  some  technical  problems  and  denies  us  the  use 
of  tables  for  computing  the  binomial  distribution.  It  is  sufficient 
for  our  purposts  to  compute  P(v)  for  diminishing  values  of  P 
until  the  point  appears  at  which  P(v)  passes  the  critical  value  of 
0.05.  Table  I relates  differing  values  of  P to  P(v)  on  the  assump- 
tion that  P(v)  = P(v  =£  5) 30. 


TABLE  I« 


Probability  of 
SeUcting  a Single 
Negro  Venireman 

P 

Probability  of 
Selecting  a Single 
Venire  of  30  with 
5 or  Fesver  Negroes 

P(v^s) 

Probability  of 
Selecting  30 
Such  Venires 

P(v^s)  3I> 

.25 

.20260 

4.63  X 10-11 

.20 

.42751 

8.4777  X 10~12 

.15 

.71058 

3.539  X 10-3 

.12 

.85692 

0.0097 

.11 

.89509 

0.03593 

.10 

.92681 

0.1002 

.05 

.99672 

0.9053 

.02 

.99997 

0.9911 

The  table  shows  that  the  value  P(v)  passes  from  less  than 
0.05  to  more  than  0.05  as  P passes  from  o.rr  to  o.ro.  Conse- 
quently, in  order  to  obtain  a value  of  P(v)  which  meets  the  test 
of  the  Bernoullian  hypothesis  P would  have  to  have  a value  less 
than  0.11.  For  simplicity  it  is  assumed  that  the  critical  value  is 
P - o.ro;  this  is  accurate  enough  for  present  purposes.  In  other 
words,  we  cannot  account  for  the  observed  number  of  Negroes 
on  venires  unless  we  assume  that  eligible  Negroes  constitute  Yio 
or  less  of  the  total  eligible  population. 

Since  Negroes  comprise  approximately  25%  of  the  adult  male 
population,  there  are  three  times  as  many  adult  male  whites  as 
adult  male  Negroes.  On  our  assumption  that  no  more  than  10% 
of  the  eligibles  are  iSegroes,  there  must  be  at  least  nine  times  as 
many  eligible  whites  as  eligible  Negroes.  This  means  that,  to 
satisfy  the  statistical  test  established  for  the  Bernoullian  hypoth- 

19  This  table  was  computed  as  described  iii  note  6j  supra. 


Michael  0.  Finklestein 

esis,  it  must  be  assumed  that  tlse  proportion  of  adult  male 
whites  qualifying  is  at  least  three  times  as  large  as  the  proportion 
of  adult  male  Negroes  qualifying.  For  example,  if  all  the  whites 
qualify,  then  not  more  than  one-third  of  the  Negroes  may  qualify; 
if  75%  of  the  whites  qualify,  then  not  more  than  25%  of  the 
Negroes  may  qualify. 

Qualifications  for  jury  service  may  be  divided  roughly  into 
objective  and  subjective  factors.  The  most  important  objective 
factor  is  literacy;  among  the  subjective  factors  are  a reputation 
for  honesty  and  intelligence,  community  esteem,  and  so  forth. 
The  Court  should 1-  \ first  at  census  and  other  data  to  determine 
the  impact  of  the  objective  factors.  If  these  do  not  account  for 
the  three-to-one  qualification  rate  in  favor  of  whites,  then  the 
effect  of  the  subjective  factors  must  be  considered. 

The  Census  Bureau  has  not  reported  how  many  Negroes  in 
Talladega  County  have  completed  six  years  of  school  (and  are 
thus  presumably  literate70),  but  the  statewide  average  census 
figures  show  that  approximately  48  % of  adult  male  Negroes  and 
83%  of  adult  male  whites  have  had  such  schooling.71  If  this 
average  applies  to  Talladega  County,  the  greater  rate  of  illiteracy 
among  Negroes  cannot  by  itself  explain  their  absence  from  the 
venires,  since  the  comparative  literacy  ratio  is  not  three-to-one 
but-  less  than  two-to-one.  Assuming  that  literacy  is  the  only  sig- 
nificant objective  factor,72  the  three-to-one  qualification  rate  must 
be  owing  in  part  to  the  application  of  the  subjective  criteria. 

An  eligible  venireman  must  be  literate  and  must  also  satisfy 
the  subjective  criteria.  We  must  now  determine  the  proportion 
of  the  adult  male  literates  of  each  race  qualifying  on  subjective 
grounds.  Since  there  are  three  times  as  many  whites  as  adult 
Negroes,  and  since  83%  of  the  whites  and  48%  of  the  Negroes 
are  literate  (using  the  sixth  grade  schooling  test),  the  ratio  of 
adult  literate  whites  to  adult  literate  Negroes  is: 

3X83% 

1 X 48%  5 ■ 

T0TbLs  b the  standard  (or  presumptive  literacy  used  la  the  Voting  kights  Act 
of  iv<5$,  41  U.S.C.  $3  1971(c),  i97Jb(e)  (Supp,  I,  1965). 

Tl  Department  or  Commerce,  Bureau  07  Census,  VS.  Ctvsi/s  or  Pofula- 
tion:  19601  vo!.  x,  pt.  3 (Alabama).  Talladega  County  Negroes  over  the  age  of 
twenty -five  are  more  literate  than  adult  Negroes  In  the  state  as  a whole,  since 
their  median  schooling  is  6.5  years.  Id.  at  :-:or,  2-334, 

TaTbis  imp.ies  that  conviction  of  an  offense  Involving  moral  turpitude  and 
habitual  drunkenness,  the  two  other  objective  factors  mentioned  by  the  Alabama 
statute,  do  not  markedly  affect  the  relative  proportions  of  Negroes  and  whites 
qualifying. 
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Thus,  when  adult  males  emerge  from  the  literacy  test  there  are 
about  five  times  as  many  whites  as  Negroes.  But  we  know  that, 
after  the  literacy  and  subjective  tests,  there  are  at  least  nine  times 
as  many  whites  as  Negroes.  We  know,  therefore,  that  whites  are 
almost  twice  {%)  as  successful  as  Negroes  in  getting  through 
the  subjective  tests.  In  other  words,  the  proportion  cf  literate 
Negroes  qualifying  on  subjective  grounds  is  % of  the  corre- 
sponding proportion  of  white  literates. :i 

The  qualifications  issue  has  now  been  refined  to  the  point 
where  the  problems  of  legal  policy  are  exposed  for  the  Court  to 
deal  with  unencumbered  by  subsidiary  statistical  questions.  The 
principal  legal  question  is  whether  a state,  by  imposing  subjective 
criteria  for  jury  service  or  by  its  method  of  administering  the 
system,  may  act  in  such  a way  that  the  rate  of  qualification  among 
literate  Negroes  is  substantially  smaller  (in  our  case  by  almost 
half)  than  the  rate  among  literate  whites,  Several  resolutions  are 
possible.  The  Court  might  hold  such  a result  unconstitutional  per 
se.  It  might  view  the  result  as  prima  facie  evidence  of  discrimina- 
tion which  the  defendant  would  be  at  liberty  to  rebut.  Or  it  might 
require  the  petitioner  to  present  evidence  of  specific  malfeasance. 
Whatever  the  resolution,  these  alternatives  present  problems  of 
basic  importance  to  this  branch  of  the  law.  xhe  Court  did  not 
reach  those  problems  in  Swain  because  of  its  inability  to  assess 
the  significance  of  statistical  data  without  mathematical  tools. 

This  line  of  reasoning  may  Li  expressed  algebraically  33  follows:  If  ES/LN 
equals  the  proportion  of  eligible  Negroes  in  the  literate  Negro  population  and 
EW/LW  equals  the  proportion  of  eligible  whites  in  the  literate  white  population, 
we  are  seeking  to  determine  the  number  a such  that  EX/ LX  (o){EWfLW). 
Transposing,  a = {EN/E)Y)(L\V/LX).  We  have  determined  that  the  proportion 
of  eligible  Negroes  to  eligible  whites  {ES/EW)  must  be  no  greater  than  %.  Thus, 
at  most,  EX/EIY  = VJ.  We  have  also  determined  that  the  proportion  of  literate 
whites  to  literate  Negroes  {LV//LX)  is  approximately  5.  Combining  the  two: 
a = (EX/E\V)(LW/LN)  = (U)<5)  the  result  given  in  the  text. 

When  we  examined  the  representation  of  Negroes  on  venires  without  consider- 
ing the  qualifications  issue,  the  statistics  were  so  extreme  that  it  mad* *  little  dif- 

• ference  which  assumptions  or  methods  were  used.  This  is  no  longer  true  when 
dealing  with  the  qualifications  issue.  If  twenty  rather  than  thirty  venires  v/ere 
assumed  chosen  consecutively,  P wuuM  be  closer  to  0.12.  The  simultaneous  use  of 
other  assumptions  favorable  to  the  state  cnuM  raise  this  figure  even  higher.  On 
the  other  hand,  we  have  omitted  information  which  cuts  toe  other  wa>,  the 
testimony  that  during  thu  period  there  were  three  0.  four  grand  juries  without 
cny  Negroes.  Record  on  Appeal  at  10.  It  is  not  necessary  for  our  purposes  to 
work  out  the  numerical  consequences  of  the$s  possibilities.  We  do  rote  that  our 
results  are  sensitive  to  cur  assumptions,  and  that  i:  the  Court  had  reached  the 
qualifications  issue,  it  might  reasonably  have  decided  that  more  accurate  Informa- 
tion was  essential  for  decision. 
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It  is  appropriate  to  consider  the  significance  of  using  the  critical 
value  0.05  in  the  foregoing  analysis.  The  size  of  the  critical 
value  affects  the  probability  of  error.  When  will  a test  with  a 
critical  factor  of  0.05  lead  to  wrong  results?  In  considering  this 
question,  statisticians  usually  divide  error  into  two  types.  Type  I 
error  is  the  error  of  rejecting  the  hypothesis  of  random  selection 
when  it  is  true.  When  the  critical  value  0.05  is  used,  this  type  of 
error  will  occur,  on  the  average,  not  more  than  one  time  in  twenty 
— on  the  one  occasion  in  twenty  when  a series  of  Bernoulli  trials 
yields  the  observed  distribution.  Only  the  state  has  standing  to 
complain  of  type  I error  since  it  involves  a finding  of  discrimina- 
tion when  there  is  none. 

Type  II  error  is  the  error  of  accepting  the  random  selection 
hypothesis  when  it  is  false.  The  extent  of  such,  error  is  more 
difficult  to  determine.  However,  only  the  defendant  has  standing 
to  complain  of  type  II  error  since  it  involves  a finding  of  non- 
discrimination when  in  fact  there  is  discrimination.  When  a court 
determines  with  the  selected  test  that  discrimination  is  present, 
the  only  error  it  need  consider  is  type  I error.  It  is  only  when  no 
discrimination  is  found  that  type  II  error  need  be  considered. 

It  should  be  noted  that  on  the  facts  in  Swain  a substantially 
smaller  critical  value  (and  thus  substantially  greater  accuracy) 
does  not  change  the  results  very  much.  Thus  if  the  critical  value 
were  0.01  then  P would  equal  about  0.12  74  and  the  rate  of  dis- 
qualification for  the  subjective  factors  would  be  recomputed  as 
This  is  not  materially  different  insofar  as  the  legal  issues  are 
concerned  from  the  % rate  which  was  computed  using  the  critical 
VfVlue  0.05. 

Are  a critical  value  of  0.05  and  a type  Terror  of  one-in-twenty 
too  large?  This  is  a legal  issue  for  which  there  can  be  no  firm 
answer.  The  Court  must  weigh  the  risk  of  intruding  on  a state’s 
selection  process  without  cause  against  the  risk  of  upholding 
venires  which  were  in  fact  chosen  discriminatorily.  An  0.05  test, 
however,  seems  consistent  with  another  case  decided  by  the 
Court.  In  A very  v.  Georgia  7S  defendant  was  con •••t" ted  by  a jury- 
selected  from  a panel  of  sixty  veniremen.  The  veniremen  were 
chosen  from  a box  containing  tickets  with  names  of  the  persons 
on  the  jury  roll  — yellow  tickets  for  Negroes  and  white  tickets 
for  whites.  Five  percert  of  the  tickets  in  the  box  were  yellow, 
but  not  a single  yeilo-.v  ticket  was  selected  for  the  venire  of  sixty. 

74  5ft  Tibli  I,  p.  3 5 1 sufn. 

t,34S  U.S.  559 
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The  Court  held  that  the  all-white  venire  and  the  opportunity  to 
discriminate  presented  by  the  yellow  tickets  constituted  a prima 
facie  case  of  discrimination.  Mr,  Justice  Frankfurter,  concurring, 
concentrated  on  the  absence  of  Negroes  from  the  venire:  “The 
mind  of  justice,  net  merely  its  eyes,  would  have  to  be  blind  to 
attribute  such  an  occurrence  to  mere  fortuity.11 76 

The  statistical  case  is  hardly  that  conclusive.  The  probability 
of  selecting  a white  is  Q - 0.95.  Applying  the  multiplication  rule, 
the  probability  that  sixty  whites  would  be  selected  is  Q(w  = 60) 
- (°*95)co  - 0.046.  This  probability  is  enormously  greater  than 
that  computed  in  Swain.  The  Court’s  conclusion  in  Avery  is  ac- 
ceptable under  our  test  only  if  a critical  value  of  the  order  of  0.05 
is  viewed  as  sufficiently  small.  It  may  have  been,  however,  that 
the  Court's  intuitive  evaluation  of  the  probabilities  was  influenced 
by  its  knowledge  that  the  colored  ticket  system  furnished  a way 
to  discriminate  and  suggested  an  intent  to  do  so. 

B . An  Analysis  of  the  Selection  of  Grand  Jurors 

We  now  consider  the  issue  which  the  Court  would  have  faced 
in  Barksdale  v.  Louisiana  ,7  had  it  granted  certiorari,  namely, 
whether  a given  distribution  of  Negroes  on  grand  juries  is  con- 
sistent with  random  selection. 

For  problems  of  this  type  statisticians  frequently  use  a good- 
ness of  fit  test  known  as  the  chi-square  (x2)  test.7*  In  essence  the 
chi-square  test  as  applied  to  this  problem  is  a measure  of  the 
probability  that  the  observed  distribution  of  Negroes  on  grand 
juries  would  differ  to  the  degree  it  does  from  the  distribution 
which  would  be  most  probable  if  the  selection  of  juiors  were 
truly  random.  By  distribution,  I mean  the  proportions  of  grand 
juries  with  exactly  one  Negro,  exactly  two  Negroes,  exactly  three 
Negroes,  and  so  forth.  To  construct  the  test  one  must  assign  a 

7tld.  at  564. 

77  347  La.  1 9 3,  170  So.  id  374  (1964),  cert.  denied,  3S1  US.  93!  (1965). 

See  Robinson,  Bias,  Probability,  end  Trill  by  Jury,  15  Am.  Sociological 
Rev.  73  (1950)-  For  a nontechnical  cxphmUon  or  see  J.  Hodges  & E.  Lehman*!/, 
Basic  Concepts  of  Probability  and  Statistics  2S5-S3  (1464).  More  extensive 
discussion  may  be  found  in  R,  Fisher,  St.vumicu  Methods  for  Research 
Workers  73-iij  (13th  ed.  rev.  195S);  Fhh:r,  Thr  Conditions  Under  Which 
Measures  the  Discrepancy  Between  Obitrvcllrn  end  Hypothesis,  87  J.  Ro*al 
Statistical  Soc\  442  (19:4).  Tests  bis;.]  on  odier  factors  such  as  the  oc- 
currence of  "rum”  are  also  possible.  Ser  Divi !,  .1  xi  'Smooth9  Ten  }or  Goodness 
of  Fit,  3 4 Biometrika  :a<)  f x 047) ; S.vcd  ar.d  Ebcr.bart,  Tcblcs  hr  Testing  n* 
demness  of  Grouping  in  a Sequence  oj  Alteratives,  14  The  Annals  of  Math- 
ematical Statistics  65  C1943). 


Michael  0,  Finklestein 

measure  to  tho  difference  between  the  observed  and  expected 
distributions,  and  then  determine  the  probability  that  a distribu- 
tion with  a deviation  at  least  as  large  as  the  observed  deviation 
will  occur.  The  less  probable  such  a deviation,  the  more  we  sus- 
pect our  hypothesis.  The  theory  of  the  chi-squarc  test  i>  simple, 
although  actual  computation  of  the  probabilities  leads  to  some 
technical  problems. 

If  P(v)  is  the  probability  that  a jury  with  exactly  v Negroes 
would  be  chosen  at  random,  and  if  iV  juries  are  chosen,  the  num- 
ber of  juries  with  v Negroes  most  likely  to  be  selected  is  NP(v). 
This  is  called  the  expected  number  of  juries  with  v Negroes,  and 
is  abbreviated  as  £(v). 

We  are  interested  in  Lhe  difference  between  the  expected  and 
observed  number  of  juries  for  every  proportion  of  Negroes  from 
o °/o  to  100%.  This  difference  is  0(u)  — E(v),  where  the  ob- 
served number  of  juries  with  v Negroes  is  0(v)  and  the  expected 
number  of  such  juries  is  E(v).  If,  for  example,  the  probability 
of  selecting  a jury  with  two  Negroes  is  P {v  = 2)  = % and  four 
juries  were  chosen  we  would  expect  that  one  of  the  juries  would 
have  two  Negroes  since  E(v  = 2)  = NP(v  = 2)  = 4XK  = 1. 
If  in  an  actual  drawing  of  four  juries  none  had  two  Negroes,  then 
the  difference  between  the  number  observed  and  the  number  ex- 
pected would  be  (0  - 1)  = — 1. 

In  order  to  obtain  a total  measure  for  the  deviation  irom  the 
expected  number  of  Negroes  on  juries  the  difference  between  the 
observed  and  expected  values  is  summed  up  for  each  v from  v = o 
to  v = 12  (the  size  of  each  jury).  In  other  words  we  compare 
the  difference  between  the  observed  and  expected  number  of 
juries  with  0,  1,  2 ...  12  Negroes,  add  these  differences,  and 
obtain  a single  number  which  measures  the  totality  of  the  dif- 
ferences. Since  the  observed  values  may  be  greater  or  less  than 
the  expected  values,  these  differences  would  in  some  cases  be 
negative  (as  in  the  example)  and  thus  reduce  the  sum.  This 
would  be  misleading  because  in  this  context  it  is  irrelevant  which 
is  greater,  the  expected  or  observed  frequencies;  the  only  sig- 
nificant fact  is  that  they  are  different.  For  this  reason,  mathe- 
maticians usually  consider  the  square  of  each  difference,  which 
is  always  positive.  Finally,  in  order  to  make  the  value  of  the 
differences  proportional  to  their  expected  frequencies  each  squared 
difference  is  divided  by  its  associated  expected  frequency.™  We 


T*  If,  for  a particular  v,  0(v)  = :ji  and  E(v)  = ijo,  the  difference  between 
them  would  be  t.  But  this  is  a much  better  agreement  between  observation  and 
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thus  obtain  the  following  expression  for,\2  (chi-square)  as  the  sum 
of  the  squares  of  the  differences  between  the  observed  and  ex- 
pected frequencies  divided  by  the  expected  frequency: 


Note  that  x3  increases  as  the  difference  between  the  observed 
and  expected  frequencies  increases  and  decreases  to  zero  as  the 
sum  of  the  differences  between  the  observed  and  expected  fre- 
quencies diminishes. 

Given  a particular  distribution  of  Negroes  on  juries,  the  issue 
presented  is:  what  is  the  probability  of  obtaining  a distribution 
which  deviates  to  this  or  a greater  extent  from  the  expected  distri- 
bution? In  terms 'of  the  x terminology  this  is  equivalent  to 
asking  the  probability  of  a distribution  with  an  associated  x* 
value  which  equals  or  exceeds  the  x of  the  observed  distribution. 
The  answer  is  obtained  by  computing  the  value  of  x f°r  every 
possible  distribution,  determining  the  distributions  with  values 
of  x3  which  are  equal  to  or  greater  than  the  given  x3t  computing 
the  probability  of  the  occurrence  of  each  such  distribution  (using 
the  probability  formulas  previously  derived),  and  then  summing 
up  these  probabilities  The  resulting  sum  is  the  probability  of  the 
occurrence  of  a distribution  with  a x3  which  equals  or  exceeds 
the  x3  of  the  observed  distribution.  This  probability  is  called  the 

X probability. 

The  formula  for  x2  tests  on  the  assumption  of  a series  of  Ber- 
noulli trials  with  constant  probability  P of  selecting  a Negro.  If 
there  is  an  extremely  small  probability  of  a distribution  with  a x3 
value  equal  to.  or  greater  than  the  observed  x3  value,  doubt  is 
naturally  cast  upon  the  assumption.  The  x3  probability  should 
then  be  computed  using  other  values  of  P.  If  this  leads  to  a 
sufficiently  large  x3  probability,  the  value  of  P used  to  compute 

XI  should  then  be  compared  in  the  same  way  as  in  the  test  previ- 
ously described  with  the  evidence  of  qualifications  of  the  Negro 
community. 

The  x3  test  differs  in  an  important  respect  from  the  previous 
test.  Since  in  the  previous  test  we  sought  to  determine  the  proba- 
bility that  so  few  Negroes  would  be  chosen,  it  was  always  possible 

expectation  than  if  0(v)  = a and  E(-j)  = I.  Dividing  by  E(v)  brings  out  this 
proportionality  factor.  Thus  in  the  first  example  x*  = -^-but  in  the  second 
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to  find  a value  for  P small  enough  account  for  the  observed 
racial  population  of  the  venires.  If  very  few  Negroes  appeared 
on  venires,  P would  have  to  be  assumed  to  be  very  small, 
and  thus  inconsistent  with  the  actual  qualifications  of  the  Negro 
community.  Still,  in  every  case  some  value  of  P could  be  found 
which  made  the  observations  consistent  with  the  hypothesis  of 
randomness.  But  since,  in  dealing  with  the  x 5 test,  we  are  deter- 
mining the  probability  of  the  appearance  of  a particular  distribu- 
tion and  not  just  a maximum  number  of  Negroes,  we  cannot  in 
■ all  cases  account  for  the  observations  merely  by  reducing  the 
value  of  P.  The  value  of  P which  maximizes  the  x2  probability 
may  still  lead  to  a value  too  small  to  be  consistent  with  the  Ber- 
noullian  hypothesis. 

This  property  becomes  important  since  it  enables  us  to  apply 
the  x 2 test  to  distributions  when  P is  unknown.  In  such  cases 
we  assume  that  value  for  P which  will  lead  to  the  maximum  x2 
probability.  If  the  x 2 probability  so  computed  is  still  too  small 
t.q  be  consistent  with  the  Bernoullian  hypothesis,  then  that  hy- 
pothesis must  be  rejected  since  no  value  of  P would  lead  to  its 
acceptance.  In  other  words,  no  matter  what  we  are  willing  to 
assume  about  Negro  qualifications,  if  we  find,  say,  that  every 
grand  jury  contains  one  Negro,  neither  more  not  less,  we  may  be 
able  to  conclude  that  selection  has  not  been  random. 

These  observations  may  be  formalized  as  the  following  test: 
The  assumption  that  the  racial  result  of  grand  jury  selection  cor- 
responds to  a series  of  Bernoulli  trials  is  refected  whenever  a 
distribution  appears  for  which  the  maximum  x1  probability  is  less 
than  a selected  critical  value.  For  purposes  of  illustration  the 
critical  value  P{v)  = 0.05  is  selected.  The  same  analysis  of  type 
I and  II  error  may  be  applied  to  the  x2  test  as  to  the  previous  test. 

We  now  apply  the  x2  test  to  the  statistics  presented  to  the 
Court  in  the  petition  for  certiorari  in  Barksaale.  In  Orleans 
Parish  there  were  nine  grand  juries  selected  between  September 
1958  and  September  1962,  when  the  grand  jury  which  indicted 
Barksdale  was  impaneled.  There  were  two  Negroes  on  eight  of 
these  juries  and  one  on  the  ninth.  Each  grand  jury  liad  twelve 
members.  Negroes  comprised  one-third  of  the  adult  male  popula- 
tion of  the  parish,  but  the  Supreme  Court  of  Louisiana  held  that 
the  smaller  number  of  Negroes  on  grand  juries  was  explained  by 
their  lower  level  ot  literacy  and  by  the  fact  that  more  Negroes 
than  whites  requested  excuses  because  of  economic  hardship. 
There  was  no  evidence  as  to  the  number  of  Negroes  and  whites 
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requesting  excuses.  For  this  reason  it  is  assumed  that  P is  some 
unknown  value  less  than  l/i. 

It  will  be  recalled  that  x2  is  computed  from  the  differences 
between  t^e  observed  and  expected  numbers  of  Negroes.  De- 
termining the  observed  numbers  of  Negroes  0(v)  is  easy.  Since 
there  was  cne  jury  with  one  Negro  and  eight  juries  with  two 
Negroes,  0(i)  = r;  0(2 ) = 8;  0(v)  = o (for  all  other  v).  De- 
termining the  expected  numbers  of  Negroes  E(v)  is  more  difficult. 
By  the  formulas  previously  derived,  for  each  t*: 

E(v)  = NP(v)  = — - P*  0IS~  * 

(»,' I (l2 - *•>)! 

In  this  equation,  9 is  the  value  of  N,  the  total  number  of  grand 
juries  impaneled.  In  order  to  determine  £(v)  it  is  necessary  to 
determine  P.  Since  P is  treated  as  unknown,  we  select  a value  for 
P which  will  maximize  the  % 3 probability.  This  occurs  when  the 
value  of  xs  is  at  a minimum,  since  at  that  point  there  are  the 
greatest  number  of  distributions  with  x'  values  which  equal  or 
exceed  the  x5  of  the  observed  distribution.  In  other  words,  we 
are  looking  for  the  value  of  P which  will  yield  the  expected  dis- 
tribution most  in  accord  with  the  observed  distribution.  The 
value  of  P which  will  lead  to  minimum  y?  can  be  determined 
in  most  cases  by  using  differential  calculus.8* 

Computing  the  x2  probability  on  the  facts  here  ic  a forbidding 
task.  The  x2  value  for  each  possible  distribution  of  Negroes  and 
whites  on  juries  must  be  computed  and  the  probability  determined 
for  each  distribution  with  a x2  value  whicli  is  not  less  than  the 
X*  of  the  observed  distribution.  If  both  the  number  of  jurors 
and  the  number  of  juries  is  large  the  number  of  possible  distribu- 
tions of  Negroes  becomes  extremely  large,  and  ihe  calculation  of 
X and  the  associated  probabilities  becomes  too  long  even  for 
modern  computers  operating  within  a reasonable  span  of  time. 
Fortunately,  as  these  numbers  become  large  it  is  possible  to  ap- 
proximate the  v5  probability  by  means  of  an  approximation 
function  known  as  the  x2  distribution,  whose  values  are  widely 
available  in  tables.®’ 

The  approximation  function  cannot  be  used  with  any  confidence,  • 
however,  to  compute  the  x2  probability  in  Barksdale.  The 
sample  is  too  small  for  mis  purpose.  The  facts  in  Barksdale, 


,pIn  the  Instant  case  x1  be  a minimum  ’.vac a F = o.t6*.  v.'bfch  Is  cloae 
to  Vi  or  the  avenge  proportren  of  Negroes  on  the  nine  Juries. 

,l  £j.,  P.  Hoel,  supra  nrte  56,  app. 
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however,  permit  a simplification  which  avoids  the  need  for  a 
computer.  Since  the  number  of  Negro  jurors  remained  almost 
constant  over  the  nine  juries,  a simple  test  can  be  constructed 
based  on  a determination  of  the  probability  that  the  numbers  of 
Negroes  appearing  on  juries  would  have  been  thus  constant  if  the 
selection  of  jurors  was  random.  To  test  this  issue  using  a ver- 
sion of  the  x a approach,  we  begin  by  asking  how  many  possible 
distributions  of  seventeen  Negroes  on  the  twelve  juries  have  a 
variation  not  greater  than  the  variation  of  the  observed  distribu- 
tion. The  answer  is  clear:  the  observed  distribution  (two  Negroes 
on  each  jury  save  one  which  had  o.v')  ;•  the  most  constant  pos- 
sible; any  other  distribution  of  seventeen  would  show  a greater 
variation.  Thus,  out  of  all  possible  distributions  of  seventeen 
Negroes,  the  probability  of  observing  a distribution  which  varied 
as  little  as  that  observed  is  simply  the  proportion  which  the 
probability  of  the  observed  distribution  bears  to  the  total  probabil- 
ity that  seventeen  Negroes  would  have  been  selected  regardless 
of  distribution.** 

The  computation  of  probabilities  of  this  sort  has  already  been 
discussed  and  no  new  principles  are  in\olved.  There  were  17 
Negroes  chosen  out  of  10S  jurors.  The  assumption  most  favor- 
able to  the  state  (the  one  that  leads  to  the  largest  probability)  is 
that  P — 1 Using  this  assumption  and  the  binomial  dis- 
tribution previously  derived,  the  probability  of  selecting  a single 
jury  with  two  Negroes  is  0.295.  Upon  the  same  assumption,  the 
probability  of  selecting  a jury  with  only  one  Negro  ts  0.287.  T’:e 
probability  that  eight  juries  would  have  two  Negroes  each  and 
the  ninth  would  have  one  Negro  is,  by  the  binomial  distribution: 


(0.J93)*  (o.*S?)  « o.eooi  jS 


••the  method  used  lore  It  * if <tijl  of  4 (tit  of  the  possibility,  col 

prtsecl  In  rnost  applications  of  siaiLtkal  fasthois.  thil  the  selecting  agents  miphl 
manipulate  selection  lo  product  4 rc.-dt  whkh  do;:ly  approximated  or  equaled 
the  expected  distribution.  Testnj  for  web  practices  involves  4 compulation  of  the 
probVbUily  thil  the  observe’  distribution  tvou'i  di3er  to  such  4 small  eiter.t  from 
the  upecled  distribution.  tf  t>e  x*  lest  were  used,  the  probability  of  each  dbtriba- 
tion  miih  4 x1  equal  lo  er  less  than  the  x*  of  the  observed  distribution  mould 
have  to  be  rurr.taed,  ir.d  ii  this  iota  mere  too  snail  the  hypo  thesis  of  rir.de  n 
Selection  vta'd  oe  rejected  For  example,  th;  prcbsbiijty  of  precisely  the  txpccled 
dulribut’oa  occvirritj  is  usually  so  sni’l  lv  :V  if  ft  m-cre  observed,  statisticians 
would  reject  the  random  trie  circa  hype  thesis. 

A really  sophlrtkafed  jrovp  of  Iwey  corn  Missions  r*  beat  on  dberininatton 
wou'd  have  to  select  groups  of  juries  sriich  ver*  fr-ithe.-  too  dose  nor  too  fir 
frost  the  ttpetltd.  Against  « :b  educated  mi’i.-oltnce  tie  chi-square  test  o?ers 
to  defrtse. 
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This  is  the  probability  of  the  occurrence  of  the  observed  dis- 
tribution out  of  all  possible  distributions  of  Negroes  on  juries. 
The  total  probability  of  selecting  seventeen  Negroes  regardless 
of  distribution  is: 


There  is  thus  approximately  one  chance  in  ten  that  seventeen 
Negroes  would  be  selected.  Dividing  the  two  probabilities  yields: 
c.000143 

: — - 0.0014 

0.104 

The  conclusion  is  that,  on  the  assumption  most  favorable  to 
the  state,  there  Is  approximately  one  chance  out  of  a thousand 
th?t  a distribution  of  seventeen  Negroes  on  twelve  grand  juries 
would  show  a variation  as  small  as  that  observed.  Since  this  is 
much  smaller  than  the  critical  value  0.05  we  are  led  to  reject  the 
Bcrnoultian  hypothesis.*’ 

The  significance  of  this  rejection  must  now  be  considered. 
Rejection  of  the  Bernoullian  model  of  the  selection  process  can 
be  treated  as  Implying  the  existence  of  discrimination  only  if  this 
model  would  accurately  reflect  the  racial  results  jf  jury  selection 

•*  tft  applying  1 version  of  the  **  method  to  the  fads  in  Bardsdcfe  we  sought 
to  measure  the  probability  that  the  number  of  Negroes  on  crand  juries  would  tefloct 
almost  undcviatingly  their  proportion  in  the  population  of -eligible*.  An  alternative 
test  of  the  same  blue  could  be  constructed  by  determining  the  probability  under  a 
random  Klrtthi  system  that  no  juries  trocM  hive  had  fewer  Ne;roej  and  on 
Juries  would  hive  had  a greater  number  of  Negroes.  Th:  probability  that  no  Jury 
would  have  more  than  two  Negroes  and  at  Inst  or.e  would  hive  fewer  luan  two 
b P(%  £^1)*,  that  ts  the  probability  lh?t  all  rite  juries  would  havt-  had  two  or 
fewer  Negroes  less  P{v  = i)*,  th?  probability  that  all  nine  juries  would  have  two. 
Upea  the  assumption  that  P « this  probability  is  approximately  0.04$. 
There  are  thus  slightly  lew  than  five  chancel  in  one  hundred  that  a greater  number 
of  Negroes  would  not  have  appeared  on  some  judfi  if  the  selection  were  random. 

The  other  half  of  this  proposition  is  that  there  should  have  been  some  |unes 
wiih  fewer  Negroes  than  were  observed.  The  probability  that  every  Jury  would 
have  at  kast  on*  Negro  and  that  it  least  el;M  would  have  two  or  more  b 
e t)(t  - P(v  ^ t))\  or  the  probability  on:  jury  having  a singtte  Negro 
and  the  remainder  having  two  or  mere,  p'us  (i  - P(v  ^ 0)\  or  the  pcobabFity- 
Of  all  juries  hivirg  two  0?  mere  Negroes.  Upon  the  same  assumption  concemirtg  P 
this  probability  is  approximately  oesj.  There  are  thus  slightly  less  thin  five 
chances  In  one  hundred  that  1 smaller  number  of  N?;n-:s  wojW  not  have  appeared 
on  fotre  juries  if  the  selection  were  rtn  !om. 

The  probability  of  the  joint  ettutrenee  of  be:h  events  Is  the  product  of  the 
probehtiitki  of  each.  This  ts  ooiAXocsj  - Thus  in  only  two  chances 

out  of  a thousand  would  the  number  of  N\gro  jurors  adhere  so  closely  to  0>«lr 
proper1  b th t eUglbl:  pepuhtioft.  This  Is  simlkr  to  the  t:iu!t  obtained  by  the 
telhod  used  In  the  text. 
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In  a system  free  from  discrimination.  It  is  important  to  observe 
that  whether  this  is  so  is  not  a logical  but  an  empirical  question. 
It  can  be  resolved  only  by  charting  the  results  of  a system  which 
is  free  from  discrimination  but  which  otherwise  operates  in  pre- 
cisely the  same  way  as  the  system  under  experiment.  But  al- 
though scientific  verification  is  lacking,  the  assumption  we  have 
used  seems  plausible. 

If  the  observations  arc  of  venires,  then  to  reject  the  Bernoul- 
lian  hypothesis  implies  that  P must  vary  from  venire  to  venire, 
or  from  the  selection  of  one  venireman  to  the  selection  of  the  next. 
Since,  within  each  system  now  in  use,  the  method  of  selection  is 
supposed  to  remain  the  same  for  each  venire  and  venireman,  the 
only  reason  for  variation  would  seem  to  be  intentional  interven- 
tion: when  the  officials  charged  with  selecting  veniremen  choose 
to  exclude  Negroes,  P = o and  Q = t.  It  is  this  reasoning  that 
makes  plausible  the  assumption  that  an  honest,  selection  process 
consists  of  a series  of  Bernoulli  trials  and  that  a non-Bernoullian 
process  is  discriminatory. 

Does  this  also  apply  to  the  grand  jury?  In  Barksdcic  it  was 
claimed  by  the  state  that  the  relative  absence  of  Negroes  from 
grand  juries  could  be  explained,  inter  alia,  by  their  more  fre- 
quent requests  to  be  excused  because  of  economic  hardship. 
Assuming  such  requests  occurred,  can  it  be  said  that  selecting 
grand  jurors  is,  in  the  absence  of  discrimination,  a scries  of  Ber- 
noulli trials?  There  appears  no  reason  to  treat  the  selection  of 
grand  jurors  differently  from  the  selection  of  veniremen.  The 
existence  of  an  excuse  based  on  economic  grounds  might  affect 
the  number  of  available  Negroes,  but  within  the  pool  of  eligible* 
the  selection  of  grand  jurors  must  still  be  made  without  consider- 
ation of  race.  Poverty  can  explain  why  only  one-sixth  as  many 
Negroes  as  whites  are  left  in  the  pool,  but  not  why  precisely  one- 
sixth  of  almost  every  grand  jury  selected  is  Negro.  Since  eco- 
nomic hardship  ar.d  illiteracy  operate  in  similar  ways  as  barriers 
to  jury  service  it  appears  reasonable  to  extend  the  assumption 
that  the  selection  of  venires  is  Bernoullian  to  the  grand  jury. 

Whether  this  assumption  may  Is  extended  to  tire  petit  jury 
is  much  more  doubtful.  If  it  is  constitutionally  permissible  for 
the  prosecutor  to  strike  any  Negro  from  the  jury  in  a capilfl 
case  where  the  defendant  is  a Negro,  or  to  follow  similar  racial 
tules,  the  selection  process  for  the  petit  jury  is  not  Bernoullian 
because  on  the  occasions  when  Negroes  are  excluded  P « o.  On 
the  other  hand,  if  exclusionary  rules  are  not  constitutionally  per- 
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missible  and  each  juror  must  be  considered  on  his  merits,  the 
case  for  the  Bernoutlinn  hypothesis  is  stronger  since  the  deter- 
mination to  strike  a juror  must  be  more  particularized  and  less 
dependent  on  race.  But  even  if  this  is  the  law,  it  is  still  possible 
that  P will  be  smaller  in  any'  case  where  the  prosecutor  might 
reasonably  expect  a greater  possibl'ity  of  bias  on  the  part  of  a 
Negro  juror.  If  that  is  so,  the  selection  of  petit  jurors  for  a series 
of  juries  would  not  consist  of  Bernoulli  trials.  In  short,  it  seems 
fair  to  extend  the  logic  of  the  Court’s  decisions  in  the  venire 
cases  to  grand  juries,  but  it  is  doubtful  that  the  same  methods 
can  be  applied  to  petit  juries. 

Barksdale  involved  grand  juries,  so  the  Bernoullhn  hypothesis 
should  apply.  Vet  we  have  seen  that,  even  assuming  that  the 
proportion  of  Negroes  on  grand  juries  in  Orleans  Parish  was  the 
same  as  their  proportion  in  the  population  of  eligible?,  the  fact 
that  it  was  almost,  unvaryingly  so  makes  the  distribution  incon- 
sistent with  the  hypothesis  of  random  selection.  Since  on  the 
assumption  most  favorable  tn  the  slate  the  distribution  test  leads 
to  8 rejection  of  the  hypothesis  of  randor.  selection,  we  must 
conclude  that  no  argument  based  on  the  presence  or  absence  of 
qualifications  in  the  Negro  community  will  explain  the  observed 
distribution.  It  is  overwhelmingly  unlikely  that  the  Barksdale 
facts  coutd  have  resulted  from  the  evenhanded  use  of  selection 
criteria  unfavorable  to  Negroes  or  from  sloppy,  haphazard  ad- 
ministration, Within  the  confidence  limits  of  the  test,  the  con- 
clusion seems  inescapable  that  race  was  considered  in  selecting 
grand  jurors. 


IV 

“The  Civil  War  Amendments,"  Mr.  Justice  Frankfurter  wrote, 
concurring  in  Cassell  v.  Texas,  “did  not  turn  matters  that  are 
inherently  incommensurable  into  mere  matters  of  arithmetic."  M 
To  the  reader  who  has  faithfully  threaded  his  way  through  this 
article  it  should  be  apparent  that  the  mathematical  methods  de- 
veloped here  do  not  supplant  the  necessity  for  legal  judgments. 
The  purpose  E”d  effect  of  the  mathematical  methods  is  to  draw 
Implications  front  the  data  on  which  the  legal  judgments  are 
founded.  If  the  consequence  of  such  analysis  is  a legal  decision 
that  discrimination  exists  where  uninformed  intuition  would 
have  reached  a different  conclusion,  the  decision  vas  compelled, 
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not  by  mathematics,  but  by  the  consistent  application  of  legal 
principles  to  new  information. 

A basic  legal  principle  in  the  jury  discrimination  cases  is  that 
the  selection  of  an  improbably  small  number  of  Negroes  is  evi- 
dence of  discrimination.  This  principle,  which  links  a finding  of 
discriminaton  to  a determination  of  probabilities,  opens  the 
door  to  the  use  of  statistical  analysis  in  these  cases.  The  math- 
ematical methods  described  here  have  been  used  to  calculate 
the  probabilities  which  the  law  has  established  as  relevant  for 
determining  the  existence  of  discrimination. 

The  second  legal  principle  controlling  these  cases  is  that  a 
disparity  between  the  proportion  of  Negroes  on  venires  and  in 
the  population  generally  is  evidence  of  the  improbability  of  ran- 
dom selection.  Using  this  assumption  we  have  constructed  a 
statistical  test  for  randomness  and  applied  it  to  the  number  of 
Negroes  appearing  on  the  jury  venires  described  by  the  record 
In  Swain  t>.  Alabama.  The  application  of  the  test  led  to  the  con- 
clusion that  the  selection  process  was  not  random  with  respect  to 
race  if  the  entire  aJult  male  population  oi  beth  races  was  con- 
sidered eligible.  The  results  could  not  be  justified  by  any  lack  of 
qualifications  for  jury  service  among  Negroes  unless  it  was 
assumed  that  the  rate  of  Negro  qualification  with  respect  to  such 
subjective  faclo's  as  lack  of  integrity  was  little  more  than  half 
the  rate  of  white  qualification. 

The  failure  to  select  a greater  number  of  Negroes  for  venires 
Is  not  the  only  fact  relevant  to  a claim  of  discrimination.  A 
skewed  distribution  may  also  constitute  such  evidence.  In  order 
to  lest  for  skewness  we  used  a goodness-of-fit  lest  known  as  the 
chi-square  test  to  analyse  the  distribution  of  Negroes  on  grand 
Juries  in  Barksdale  t>.  Louisiana  and  concluded  that  race  was 
considered  in  selecting  grand  jurors  in  Orleans  Parish. 

The  particular  statistical  tests  which  produced  these  results 
are  not  by  any  means  the  only  possible  tests  of  these  matters. 
Other  tests  with  different  characteristics  a*o  possible  and  have 
been  used  in  treating  cognate  problems.  Unfortunately,  an 
adequate  discussion  of  the  criteria  for  chocsing  tests  would  lead 
us  into  the  vasty  deep,  and  the  reader  may  fed  that  the  waters 
here  have  been  deep  enough  already.  It  must  suffice  to  say  that 
the  choice  of  tests  depends  on  the  i.atute  of  the  suspected  dis- 
crimination and  consequently  presents  a delicately  mixed  problem 
of  mathematics,  fact,  and  law. 

tt  may  be  felt,  however,  that  all  mathematical  tests  are  inap- 
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propriate  for  jury  discrimination  cases  because  such  methods 
arc  too  technical,  elaborate,  and  fine-grained.  Perhaps  judicial 
intuition  is  sufficient  to  decide  what  the  Constitution  requires 
and  only  the  intuitively  obvious  cases  should  be  deemed  appro- 
priate for  judicial  correction.  This  feeling  may  be  supported  by 
the  argument  that  the  Court  has  never  required  jury  selection  to 
be  mathematically  perfect  in  the  sense  that  every  potential  venire- 
man be  canvassed  for  service;  consequently,  it  may  be  said  that 
the  proposed  mathematical  technique  is  misleading  because  it 
applies  exact  methods  to  an  inexact  system.  This  charge  is  worth 
considering  because  it  can  be  leveled  at  virtually  every  attempt 
to  apply  methods  Irom  the  exact  sciences  to  legal  problems:  in 
each  case  the  legal  rules  will  invariably  reflect  the  human  imper- 
fections of  our  institutions,  but  the  methods  used  to  analyze  them 
will  not,  seemingly,  allow  for  such  imperfections. 

In  the  present  context  this  does  not  appear  to  be  a significant 
problem.  Any  statistical  method  of  analyzing  data  tells  us  the 
attributes  of  the  system  under  examination  but  does  not  tell  us 
the  causes  of  those  attributes,  though  it  may  be  able  to  rule  out 
some  conceivable  causes.  Thus  in  Swain  we  found  that  in  order 
to  sustain  the  observations  it  must  be  assumed  that  the  rate  of 
qualification  of  Negroes  for  jury  service  was  substantially  smaller 
than  the  qualification  rate  for  whites.  We  assumed  that  this  oc- 
curred because  the  commissioners  disqualified  a proportionately 
greater  number  of  Xegroes,  but  it  may  be  that  the  commissioners 
only  Innocently  canvassed  proportionately  fewer  literate  Negroes 
than  literate  whites. 

In  this  and  other  cases,  however,  the  cause  of  the  result  may  be 
legally  irrelevant.  The  Supreme  Court  has  held  that  an  Innocent 
failure  to  canvass  any  Negroes  ot  all  was  not  an  acceptable  excuse 
for  a result  which  was,  in  effect,  the  equivalent  of  discriminatory 
selection.11  In  at  least  some  cases,  then,  a system  whose  results 
appear  to  be  discriminatory  may  be  treated  as  such  even  though 
the  commissioners  convincingly  profess  good  faith.  Statistical 
techniques  can  be  used  to  identify  such  cases.  Even  when  a case 
is  not  extreme,  so  that  the  courts  are  willing  to  hear  a defense 
of  good  faith,  mathematical  methods  may  be  able,  as  In  the 
Barksdale  case,  to  provide  a test  of  that  defense. 

The  acceptance  of  mathematical  methods  undoubtedly  implies 
a certain  yielding  by  judges  of  their  freedom  of  decision.  In 
Swain  the  mathematical  case  appears  overwhelming,  and  it  leads 
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to  a conclusion  contrary  to  that  reached  by  the  Court.  This 
consequence  of  mathematical  methods  may  be  a source  of  judicial 
reluctance  to  entrust  decisions  to  them  and  may  tempt  the  Court 
10  continue  the  familiar  practice  of  casting  legal  rules  in  broad 
discretionary  terms  to  ensure  that  legal  principles  will  not  em- 
barrass with  results  that  run  against  the  grain  of  judicial  intuition. 

In  the  developing  stages  of  a legal  rule,  a reservation  of  discre- 
tion may  be  the  better  part  of  judicial  valor,  but  where  principles 
have  become  established  over  a long  period  the  desire  to  reserve 
an  unmeasured  power  over  their  consequences  is  not  an  appealing 
motive  for  rejecting  objective  methods.  In  the'sensitivc  area  of 
the  lav/  we  have  been  considering,  where  cases  carry  the  freight 
of  broad  social  conflicts,  the  authority  of  decisions  will  not  be 
enhanced  by  methods  which  depend  unnecessarily  on  personal 
and  intuitive  judgments.  As  beamed  Hand  wrote,  the  author- 
ity and  immunity  of  a judge  rest  “upon  the  assumption  that  he 
speaks  with  the  mouth  of  others."  **  The  novelty  of  the  methods 
proposed  in  this  article  may  make  them  seem  strange  to  the 
lawyer  or  the  judge,  but  the  analysis  provided  by  statistical 
theory  should,  in  the  long  run,  make  the  decisions  in  the  jury 
discrimination  cases  a more  natural  and  inevitable  consequence 
of  the  data  which  determine  them. 

F.very  successful  use  of  technical  information  by  the  law  has 
had  to  travel  the  path  from  strangeness  to  tr.di-pensabilily,  The 
importance  of  objectivity  and  consistency  should  persuade  us  to 
follow  legal  principles  in  these  cases  even  when  their  logic  leads 
to  ground  made  less  certain  by  the  absence  of  the  assurance  — 
and  it  is  sometimes  a false  assurance  — which  intuition  brings 
to  judgment. 

••Hard,  Mr.  Juitkt  **  H**v.  L.  Ktv.  (19J9). 


CHAPTER  FIVE 


CLASS  NOTES 

An  exercise  for  applying  the  ideas  about  statistical  inference  contained 
in  the  readings  is  fairly  clearcut.  We  can  use  the  data  generated  by  the 
previous  exercise  concerned  with  sampling,  based  on  our  hyptheilcal 
town. 

We  will  assume  that  we  have  drawn  a random  sample  of  40  households 
and  that  we  have  calculated  the  percentage  of  households  whose  head  is 
Negro. 

What  estimate  can  we  make  of  the  percentage  of  Negro  households  in 
the  population  from  which  the  sample  is  drawn? 

What  estimate  can  we  make  of  the  average  annual  income  in  the 
population,  both  white  and  Negro,  based  on  the  information  contained  in 
our  sami  !e? 

As  the  readings  indicate,  statistical  inference  involves  not  only 
estimates  of  population  means  and  percentages  t it  also  Involves  the  testing 
of  null  hypotheses  about  the  relationship  between  characteristics.  For 
example: 

We  can  select  a random  sample  of  100  cases.  We  can  then  divide  the 
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sample  approximately  in  half,  with  regard  to  education,  dichotomizing  our 
households  into  "high"  and  "low.  " These  two  groups  can  each  be  considered 
a random  sample  of  "high"  and  "low"  educational  groups  in  the  population. 

We  can  then  calculate  the  average  annual  income  of  each  of  the  two 
groups.  The  problem,  then,  is  this:  We  can  observe  a difference  in  our 
samples  in  average  income  between  those  categorized  as  "high"  and  "low’* 
with  respect  to  education.  Does  this  difference  between  the  two  sample 
groups  represent  a difference  that  would  be  found  in  the  population  from 
which  they  are  drawn  or  is  it  a result  of  sampling  vagaries?  The  solution 
to  the  problem  can  be  found  in  the  selection  from  Mueller  and  Schuessler. 
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CHAPTER  SIX 


MEASURES  OF  ASSOCIATION 
I 

The  idea  of  correlation  or  association  lies  at  the  heart  of  all 
scientific  research.  Without  a thorough  understanding  of  that  idea,  it 
is  Impossible  to  assess  the  utility  or  the  validity  of  experimental  findings, 
survey  design,  and  so  on. 

As  John  Mueller  and  Karl  Schuessler  point  out  in  the  first  selection 
of  this  chapter,  the  attempt  to  understand  the  Idea  of  association  involves 
three  objectives.  First,  we  wish  to  identify  the  variables  which  are 
linked  to  the  phenomenon  that  interests  us.  Second,  we  wish  to  uncover 
the  pattern  of  that  linkage.  And  third,  we  wish  to  measure  the  strength  of 
that  linkage.  In  a sense,  of  course,  we  arc  all  informal  statisticians,  as 
these  authors  indicate,  and  we  are  always  using  the  Idea  of  association  or 
correlation  as  we  try  to  make  sense  out  of  our  world.  But  if  we  are  to  be 
at  all  scientific  or  rigorous,  we  must  go  beyond  this  informal  sort  of 
thinking  and  tie  down  our  concepts  more  precisely. 
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The  first  type  of  association  discussed  is  that  existing  between  two 
attributes.  In  an  earlier  chapter  we  pointed  out  that  an  attribute  is  a 
characteristic  which  is  either  present  or  absent — like  a light  bulb,  it  is 
either  on  or  off  and  its  presence  is  not  a matter  of  degree;  and  we  can  use 
the  symbols  discussed  by  Morris  Cohen  and  Ernest  Nagel,  in  a later 
selection  in  this  chapter,  and  speak  of  a characteristic  which  is  either 
present  (X)  or  absent  (X), 

When  we  ask,  then,  if  X is  related  to  Y,  we  can  set  fort  our  finding 
in  what  is  called  a 2 x 2 table:  one  row  indicates  when  Y is  present,  another 
row  indicates  when  Y is  absent;  one  column  indicates  when  X is  present, 
another  column  indicates  when  X is  absent,  (See  Table  1. ) 

TABLE  I 


X 

(X  Present) 

X 

(X  Absent) 

Y (Y  Present) 

Y (Y  Absent 

Each  cell  in  the  table  gives  the  frequency  of  the  Joint  occurrence  of 
the  four  possibilities:  XY,  XY,  XY,  and  XY.  (In  the  readings  we  have  included 
a discussion  of  onlj  one  measure  of  association,  namely  Yule's  coefficient  of 


association  or  Q,  for  we  think  it  provides  most  of  the  essential  logical  steps 
when  we  decide  whether  two  characteristics  are  related  to  one  another.) 

hi  the  next  part  of  the  selection  by  Mueller  and  Schuessler,  the 
authors  turn  io  the  concept  of  correlation  between  variables.  It  is  Important 
for  the  student  to  realize  that  the  underlying  logic  is  very  much  the  same  ar. 
that  used  in  analysing  the  association  between  characteristics,  although 
there  are  the  added  complexities  of  dealing  with  variables  rather  than 
attributes.  Thi3  can  be  seen  most  easily  by  examining  Figure  10.2,3  in 
the  selected  readings.  If,  in  this  illustration,  we  consider  -50  to  0 on 
the  X axis  as  a situation  where  X is  absent,  and  0 to  50  as  a situation  where 
X is  present;  and  if  we  consider  0 to  60  on  the  Y axis  as  a situation  where 
Y is  absent,  and  50  to  100  a situation  where  Y is  present;  It  Is  then 
possible  to  convert  the  scatter  diagram  of  Figure  10. 2. 3 Into  a 2 x 2 table. 
We  have,  in  effect,  treated  variables  as  if  they  were  attributes  by  dichoto- 
mizing them--a  very  common  device  discussed  by  Mueller  and  Schuessler. 

But  all  of  this  should  be  clear  to  the  student  in  the  following  selec- 
tion which  Is  exceptionally  lucid.  The  reading  requires  very  little  in  the 
way  of  mathematical  background  and  yet  if  the  student  grasp3  the  essential 
points  he  will  have  grasped  one  of  the  most  fundamental  concepts  in 
scientific  inquity. 
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Now  it  is  obvious  that  if  one  factor  is  found  to  be  correlated  or 
associated  with  another  it  is  no  proof  that  one  causes  another.  Yet  there 
is  an  important  logical  connection  between  correlation  and  causation  which 
must  be  examined.  A search  for  causes  is  central  to  much  of  the  research 
in  the  social  sciences  and  we  cannot  avoid  the  difficulties  that  are  involved. 

The  law,  of  course,  has  also  long  had  a concern  with  causation, 
perhaps  most  notably  in  the  field  of  torts.  The  view  of  the  law,  however, 
is  rather  different  from  that  of  science.  In  fact,  according  to  Harper  and 
James  in  the  first  selection,  when  the  law  turns  its  attention  to  the  problem 
of  causation  there  is  "a  widely  recognized  confusion.  " In  their  discussion 
the  authors  claim  that  their  presentation  is  not  a metaphysical  one  but  "an 
ordinary,  matter-of-fact  inquiry  into  the  existence  or  non-existence  of  a 
causal  relation  as  laymen  would  view  it,  and  apparently  they  hope  to  avoid 
the  ambiguities  that  plague  the  subject.  But  they  too,  we  think,  are  in- 
evitably caught  up  in  the  horrendous  difficulties  of  trying  to  equate  human 
responsibility  and  causation  in  the  complex,  multi-faceted  flow  of  events. 

Professors  Hart  and  Honor e,  in  the  selection  of  their  book, 
Causation  in  the  La w , quite  wisely  point  out  that  the  law's  concept 
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of  causality  is  really  a metaphor  which  may  obscure  empirical  reality,  as 
well  as  reveal  it.  And  as  long  as  the  law  continues  to  pursue  its  chosen 
task  of  singling  out  the  voluntary  acts  of  individuals  as  the  cause  of 
particular  events,  it  seems  very  likely  that  the  law  will  continue  to  be  at 
odds  with  the  scientific  viewpoint. 

The  reading  from  the  book  by  Professors  Cohen  ane  Nagel,  Logic 
and  the  Sci e nti f i c Method,  "summarizes  much  of  the  current  thinking 
about  causation  in  a variety  of  scientific  fields.  They  point  out  that  "common 
sense"  notions  of  cause  and  effect  are  heavily  dependent  on  a model  of  human 
behavior  in  which  individuals  deliberately  intervene  in  the  flow  of  events  to 
effect  a certain  purpose — and  the  model  is  not  very  good  when  applied  to 
the  non-human  world  and  is  of  doubtful  v/orth  in  the  world  of  human  behavior 
as  well.  In  fact,  they  argue  that  "In  the  light  of  the  great  variety  in  the 
kinds  of  specific  orders  and  factors  which  may  be  the  objects  of  an  inquiry, 

■'  may  seem  preposterous  to  believe  that  any  general  rules  can  be  stated 
wmch  will  enable  us  to  find  satisfactory  answers  to  all  possible  problems.  " 
The  search  for  causes  thus  seems  to  have  an  indeterminate  and  uncertain 
quality — which  may  be  disturbing  for  the  law  in  its  desire  for  certainty  and 
personal  accountability  but  which  is  familiar  enough  in  science. 

The  fact  that  it  is  so  difficult  to  establish  cause  and  effect  relation- 
ships and  that  the  concept  itself  is  variable  in  its  meaning  has  led  some 
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philosophers  of  science  to  urge  that  we  abandon  the  idea  of  causation  altogether. 
We  would  simply  deal  with  correlations,  recognizing  that  correlation  and 
causation  are  not  the  same  thing.  It  is  doubtful  if  many  scientists  would  be 
willing  to  go  so  far,  but  the  problems  In  dealing  with  the  idea  of  causation 
do  impose  a fair  amount  of  caution  in  scientific  inquiry. 

First,  scientists  accept  the  fact  that  there  are  generally  many  factors 
which  contribute  to  a particular  effect  and  no  one  cause  can  be  singled  out  as 
the  cause. 

Second,  cause  and  effect  represent  a statistical  probability  of  the 
joint  occurrence  of  two  or  more  events  rather  than  an  invariant  relationship. 

Third,  there  are  many  different  levels  of  discourse  and  many 
different  ways  of  organizing  data.  What  is  examined  as  a cause  in  one  in- 
stance of  scientific  inquiry  may  be  largely  ignored  in  another. 

Fourth,  in  the  examination  of  social  behavior,  experimentation  is 
difficult  or  simply  ruled  out  on  ethical  grounds.  As  a consequence,  we  are 
seldom  able  to  isolate  the  influence  of  one  variable  and  our  assertions  about 
causation  must  be  patched  together  with  many  untested  assumptions. 

In  short,  the  evidence  that  we  can  accumulate  about  cause  and  effect 
in  social  behavior  is  weak  and  uncertain.  But  it  is  far  better  than  no  evidence 
at  all. 
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EXPLAINED  VARIATION 


In  the  following  selection  by  Mueller  and  Schuessler,  the 
authors  discuss  the  measurement  of  relationships  both  between 
attributes  and  between  variables.  (They  speak  of  the  principle  of 
joint  occurrence  and  the  principle  of  covariation. ) 

It  is  the  latter — the  relationship  between  variables — which 
is  probably  most  important  for  the  student  to  understand  at  this 
point,  since  he  is  sure  to  encounter  the  concept  of  correlation 
coefficients  if  he  does  very  much  reading  in  the  social  sciences. 

One  of  the  most  useful  ways  to  view  a correlation  co- 
efficient— which  measures  the  degree  of  relationship  between  two 
variables — is  to  see  it  as  expressing  the  proportion  of  variation 
in  one  variable  which  is  "explained"  or  accounted  for  by  variation 
in  another  variable.  It  is  this  view  of  correlation  which  is  stressed 
by  Mueller  and  Schuessler  and  it  is  worth  close  attention. 


From  John  H.  Mueller  and  Karl  F.  Schuessler,  Statistical 
Reasoning  in  Sociology  (Boston:  Houghton  Mifflin 
Company,  1961)  pp.  238-249;  281-310.  Reprinted  by 
permission. 
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John  H*  Mueller  and  Karl  F*  Schuessler, 
Statistical  Reasoning,  pp  238-249;  281-310. 


Section  Ome 

Concept  of  Statistical  Association 


Principle  of  Contingency . It  is  an  axiom  of  science,  as  well  as  of 
common  sense,  that  no  event  in  nature  “just  happens,"  but  always  occurs 
under  very  specific,  known  cr  unknown,  circumstances.  An  event  is 
therefore  never  to  be  viewed  in  isolation.  It  must  be  considered  as  a 
product  of  the  joint  operation  of  many  forces,  each  of  which  contributes 
a variable  element  to  the  observed  outcome.  Thus,  the  size  of  family 
may  be  dependent  on  such  factors  as  age  at  marriage,  level  of  income, 
extent  of  employment  of  the  mother  outside  the  home,  and  the  religious 
ideology  entertainr1  the  parents.  Parole  success  of  released  prisoners 
may  be  related  to  the  type  of  crime  committed,  age,  and  history  of  re- 
cidivism. Some  of  these  factors  tend  to  promote  and  accelerate,  others 
to  retard  in  varying  degrees  or  even  inhibit,  the  occurrence  of  the  event, 
or  at  least  to  modify  its  character  or  magnitude.  Thus,  the  religious 
factor  may  foster  large  families,  while  the  economic  factor  may  restrain 
that  tendency.  In  any  case,  we  cannot  predict  the  degree  of  parole  suc- 
cess, or  explain  the  size  of  the  family,  unconditionally,  but  only  ott  the 
basis  of  specifically  designated  factors,  or  variables,  on  which  the  out- 
come is  contingent.  The  human  observer  docs  not  possess  absolute 
knowledge;  he  must  use  one  event  as  a cue  to  anticipate  another.  His 
understanding  is  therefore  grounded  in  the  principle  of  contingency.  And 
it  is  in  accordance  with  that  principle  that  the  sciences,  both  physical 
and  social,  have  set  up  their  methods  arid  objectives:  (1)  to  identify  the 
variables  (determining  factors)  associated  with  an  event;  (2)  to  uncover 
the  patterns  of  association  between  the  variables  and  the  event;  and 
(3)  to  measure  the  strength  of  that  association. 
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Although  no  event  is  conceivable  in  isolation,  It  is  not  a simple  matter 
to  detect  with  what  variables  it  is  linked.  For  example,  the  association 
between  delinquency  and  intelligence  was  once  thought  to  be  very  close, 
but  in  later  analysis  was  found  to  be  quite  remote.  This  shift  in  inter- 
pretation was  a result  not  only  of  more  carefully  controlled  studies  of 
delinquency  and  its  associated  social  factors,  but  also  of  a redefinition 
of  the  test-intelligence  norms  themselves. 

To  man,  nature  is  infinitely  complex:  hence,  the  search  for  such  under- 
lying relations  can  never  be  completed.  There  are  innumerable  factors 
whose  relations  are  so  intricate  that  (he  “final”  factor  in  the  production 
of  on  event  can  never  be  attained,  much  less  measured,  However,  in 
its  unfulfilled  quest  for  certainty,  our  common  senso  begins  very  early 
to  construct  patterns  of  relationships  on  the  basis  of  which  it  endeavors 
to  comprehend  the  past,  understand  the  present,  and  thereby  anticipate 
the  future. 

Patterns  of  Relationship.  The  patterns  of  association  are,  of  course, 
continuously  revised  in  the  trial  and  error  of  daily  experiej.ee.  In  the 
process,  the  observer  mentally  quantifies  and  summarizes  his  observa- 
tions by  first  noting  the  factors  which  seem  to  him  to  “cause”  or  to  be 
linked  with  the  event  and,  second,  by  noting  the  frequency  with  which 
lie  successfully  anticipates  or  forecasts  it,  In  this  casual  manner,  every 
person  begins  to  cultivate  the  habits  of  association  and  becomes  an  in- 
formal statistician.  He  practices  intuitively  the  principle  of  correlation. 
In  fact,  much  sociology  effectively  employs  such  intuitive  statistics, 
skillfully  put  together  by  alert,  widely -traveled  scholars,  made  without 
benefit  of  pencil  and  pnper  calculation  or  technical  procedures.  Indeed, 
many  who  ave  critical  of  the  utility  of  statistical  procedures  nevertheless 
unwittingly  employ  them  in  this  unofficial  manner. 

For  some  purposes,  a rough  subjective  approximation  cf  a correlation 
Is  fairly  satisfactory,  but  for  scientific  purposes  move  accurate  measure- 
ments are  desired.  Such  precision  is  not  a simple  matter  to  achieve. 
The  difficulty  lies  in  the  complexity  cf  the  patterns  of  relationship  in 
terms  of  which  we  view  and  organize  the  world.  Some  of  the  salient 
features  of  this  complexity  in  patterns  may  be  formulated  as  follows: 
(1)  every  event  is  the  outcome  of  multiple  (actors;  (2)  the  .orce  of  these 
respective  factors  varies  in  intensity,  and  (3)  may  flow  in  one  or  more 
directions  in  producing  its  effects;  (4)  the  factors  arc  in  constant  inter- 
action, and  consequently  (5)  they  may  reinforce,  counteract,  and  cancel 
one  another.  We  illustrate  with  a popular  example:  the  halfback  in  a 
football  game,  *vhosc  scoring  power  tends  to  be  associated  positively  with 
running  ability,  is  either  obstructed  or  aided  in  actual  performance  by 
the  condition  of  the  field,  his  fatigue,  the  type  of  plays  called,  to  say 
nothing  of  the  varied  activities  of  the  other  twenty-one  players  on  the 
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field.  This  association  between  scoring  power  and  running  ability  may 
therefore  not  be  visible  to  the  fans  in  the  stadium.  Just  so  is  the  correla- 
tion between  income  and  size  of  family  beclouded  by  such  factors  as 
religion,  age  at  marriage,  occupation  of  breadwinner,  and  education. 
It  may  at  first  seem  a homeless  task  to  disentangle  these  networks  of 
relationships  and  to  subject  them  to  statistical  reasoning.  Nevertheless, 
in  a large  number  of  observations,  the  essentia!  relation  can  be  expected 
to  shine  through  the  haze  formed  by  the  multiplicity  of  factors. 

This  problem  is  probably  less  difficult  in  the  physical  than  in  the  social 
sciences.  The  physical  scientist,  by  means  of  available  laboratory  con- 
trols, is  to  a certain  extent  able  to  segregate  and  manipulate  his  elements 
and  to  replicate  his  careful  and  undisturbed  observations,  \wiOreas  the 
social  scientist  is  often  obliged  to  accept  data  which  are  like  unrefined 
ores  from  “nature  in  the  raw"  and  to  assemble  materials  from  widely 
dispersed  sources  and  a variety  of  settings.  He  is  therefore  compelled 
to  employ  statistical  controls,  since,  in  general,  laboratory  controls  are 
closed  to  him. 

Whai  Arc  the  Evidences  of  Relationship?  Ilow  can  we  be  sure  that  factors 
are  interconnected?  And,  having  discovered  a relationship,  how  may  we  de- 
termine the  degree  or  intensity  of  that  relation?  Broadly  speaking,  thcreare 
two  earmarks  of  such  linkage:  (1)  joint  occurrence  of  attributes,  and 
(2)  parallel  changes  in  two  or  more  series  of  quantitative  observations. 

The  relative  frequency  with  which  certain  attributes  happen  together 
ir,  probably  the  most  elementary  basis  of  lay  judgment  of  association. 
This  is  the  principle  of  joint  occurrence.  Statistical  variables,  like  human 
beiugs,  arc  usually  judged  “by  the  company  they  keep."  For  example, 
if  delinquency  is  more  often  found  in  boys  than  in  girls,  we  conclude  that 
delinquency  is  associated  with  “bovness.”  The  strength  of  this  associa- 
tion wall  vary  according  to  other  factors  such  as  the  boy’s  age,  the  type 
of  delinquency,  and  many  other  elements  in  the  pattern,  all  of  which  will 
render  the  statistical  application  of  an  apparently  simple  principle  quite 
complicated.  Hence,  it  need  hardly  be  reiterated  here  that  some  system 
of  tabulation  and  classification  is  necessary  as  an  aid  not  only  in  establish- 
ing an  association,  but  also  in  determining  iU  strength. 

Second,  if  in  two  series  of  quantitative  data,  a unit  change  in  one 
variable  is  paralleled  with  some  degree  of  regularity  by  a comparable 
change  in  the  other  series  — that  is,  if  they  move  together  — we  conclude 
that  they  arc  somehow  tied  together,  and  that  there  Is  an  association 
between  the  two  sets  of  data.  For  example,  as  income  declines,  the  size 
of  family  tends  to  increase;  and,  if  the  observations  endure  through  a 
rather  extensive  range  — that  is,  for  the  entire  range  of  families  of  all 
sizes  and  of  incomes  of  varying  amounts  — the  evidence  of  a relation  is 
strengthened.  This  is  called  the  principle  of  covariation. 
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Devices  for  the  Measurement  of  Association.  Techniques  for  the  meas- 
urement of  association  must  be  adapted  (1)  to  the  nature  of  tire  data, 
(2)  to  the  number  of  variables  being  related,  and  (3)  to  the  patterns  of 
relationships  among  them,  lienee,  the  devices  will  differ  according  to 
whether  the  data  are  in  the  form  of  attributes  which  are  merely  enu- 
merated, or  measures  which  are  quantitatively  scaled;  and,  additionally, 
whether  the  pattern  of  relationship  between  the  variables  is  simple  or 
complex. 

Of  the  numerous  devices  available,  we  will  present  here  six  types, 
selected  for  (a)  simplicity  of  calculation,  (b)  the  prevalence  of  their  use, 
and  (c)  their  applicability  to  the  data  that  me  customarily  encountered 
by  sociologists.  The  different  specimens  illustrate  a variety  of  approaches 
to  the  measurement  of  association  and  should  above  all  convince  the 
student  of  the  relativity  of  their  appropriateness. 

The  f-j.~iUlas,  which  are  in  some  respects  interrelated,  may  be  con- 
veniently grouped  on  the  basis  of  the  two  principles  of  association  al- 
ready mentioned:  the  principle  of  joint  occurrence  and  the  principle  of 
covariation.  Under  the  first  we  include: 

Yule’s  Coefficient  of  Association,  Q 
Phi  Coefficient,  0 
Coefficient  of  Contingency  C 

Under  the  second,  we  include: 

Rank-Difference  Method  of  Correlation,  p 
Product-Moment  Coefficient  of  Correlation,  r 
The  Correlation  Ratio,  r? 


Questions  and  Problems 

1.  Define  the  following  concepts: 

Principle  of  Contingency 
Pattern  of  Relationship 
Joint  Occurrence 
Covariation 

2.  Characterize  the  following  relationships  as  simple  or  complex  and  give  your 
reasoning: 

age  at  marriage  and  size  of  family 

income  and  social  status 

infant-fc?ding  and  personality 

capital  punishment  and  incidence  of  murder 

anomie  and  suicide 

test  and  retest  intelligence  scores 

diameter  of  circle  nnd  its  circumference 
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Section  Two 

Coefficient  of  Association:  Yule's  Q 

One  of  the  simplest  measures  of  association  is  known  as  the  Coefficient 
of  Association , or,  more  informally,  as  YiAAs  Q . This  measure  is  designed 
to  reflect  the  degree  of  association  between  a pair  of  qualitative  variables, 
arranged  in  a 2 X 2 (fourfold)  table.  In  his  study  of  cross-classification 
the  student  has  already  acquired  the  sense  of  inferring  association,  and 
even  cause-and-cffcct  relationships,  between  dichotomous  variables.  But 
at  that  juncture,  we  did  not  seek  to  compute  a single  over-all  measure 
which  would  reflect  the  strength  of  that  relation.  Yule's  Q and  other 
measures  of  association  are  designed  to  do  just  that,  since  they  are  sum- 
marizing measures  for  bivariate  data,  analogous  to  the  mean  for  univari- 
ate data. 

Inadequacy  of  the  2 X / Tabic  for  Estimate  of  Association . Many  persons 
untrained  in  quantitative  reasoning  naively  succumb  to  the  temptation 
of  drawing  conclusions  on  the  degree  of  association  from  a 2X1  table, 
instead  of  from  a 2X2  table,  which  is  the  minimum  for  that  purpose. 
The  deceptive  ease  with  which  such  erroneous  deductions  can  be  made 
is  illustrated  in  the  four  2X1  tables  depicted  in  Table  9.2.1. 


Tabic  9.2.1  2 X / Tables 


Boys  Per  Cent 

Delinquent  10% 

Non-delinquent  00 

100% 

RciwbUcctns  Per  Cent 
Isolationist  50% 

Internationalist  50 

100% 

stales  Per  Cent 

A Grades  ' 10% 

Other  90 

100% 

Fei»c.k  Drivers  Per  Cent 
AccMont  20% 

No  Accident  SO 

100% 

Source:  Hypothetical 


Since  only  10  boys  out  of  100  are  delinquent,  a layman  may  be  misled 
into  the  conclusion  that  there  is  a very  weak  association  between  boyness 
and  delinquency;  similarly,  that  women  are  not  accident  prone.  Since 
Republicans  divide  50-50  on  isolation,  there  would  seem  to  be  no  striking 
tendency  for  Republicans  to  favor  one  or  the  other  foreign  policy. 


I nV  nu * * ^ a «*** 
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But  the  important  point  is  that  delinquencies  are  distributed  between 
boys  and  girls,  and  that  auto  accidents  are  shared  by  both  men  and 
women  drivers.  We  must-  thcrofoic  inquire  in  what  proportion  the 
limited  supply  of  violations  or  auto  accidents  are  divided  between  the 
sexes.  To  which  sex  do  delinquencies  or  accidents  preferentially  attach 
themselves?  To  estimate  the  degree  of  affinity  for  either  sex,  we  must 
know  the  delinquency  rule  for  both  sexes;  a delinquency  rate  for  boys 
can  be  said  to  be  high  only  when  the  rate  for  the  girls  is  known.  Let 
us  therefore  provide  the  rates  for  females  and  Democrats  in  the  2X1 
tables,  and  analyze  the  results. 


ToWe9.2.2  2 X 2 Tables 


Delinquent 

Non-delinquent 

Boys 

10% 

00 

100% 

Girh 

0% 

100 

100% 

Isolationist 

Internationalist 

Rep. 

50  % 
50 

100% 

Dem. 

30% 

70 

100%, 

Mate 

Female 

Men 

Women 

A Grades 

10% 

20% 

Accident 

10% 

20% 

Other 

90 

SO 

No  Accident 

90 

100% 

100% 

100% 

100%, 

Source:  Hypothetical 


The  delinquency  rate  of  boys  now  turns  out  to  be  rather  high,  since 
even  a 10  per  cent  delinquency  rate  is  higher  than  no  delinquency  at  all. 
Similarly,  the  grades  of  college  men  are  below  the  norm;  the  Republi- 
cans display  a marked  propensity  toward  isolationism;  and  women  driv- 
el^ show  a marked  susceptibility  to  mishaps. 

The  fact  is  that  we  have  now  introduced  a standard  of  judgment  against 
which  the  2X1  table  may  be  compared.  Some  standard  Is  inevitably 
and  unwittingly  introduced  by  every  observer,  and  it  is  the  function  of 
statistical  procedures  to  make  the  standard  explicit.  Hence,  a 2X2 
table  is  the  minimum  for  a dependable  conclusion  on  association. 

The  2X2  Table.  A 2 X 1 table  merely  presents  the  two  subclasses  of 
a single  variable,  whereas  a 2 X 2 table  presents  the  subclasses  of  two 
vu'iables  cross-classified  by  one  another.  In  Table  9.2.3,  a case  is  clas- 
sified as  boy  and  delinquent,  r.s  girl  and  non-delinquent.  This  double 
classification  automatically  establishes  the  distribution  of  joint  occur- 
rences between  a given  sex  and  behavior  which  is,  of  course,  basic  to 
tire  understanding  of  the  phenomenon  of  association.  In  the  hypo- 
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Table  9.2.3 


Boy  Girl  Total 


Number  of  Delinquents  by  Sex 


Delinquent  20  0 20 


Non-delinquent  30  50  SO 


Total 


50  50  100 


thetical  group  of  100  children  shown  in  Table  0.2.3,  there  are  20  delin- 
quents, all  of  whom  are  boys.  Most  of  the  children  are,  of  course,  non- 
delinquent. But,  given  a delinquency,  there  is  perfect  prediction  — 
that  is,  complete  certainty  abo  «t  the  sex  of  the  delinquent.  Hence, 
taking  the  data  at  face  value,  we  may  say  that  delinquency  i3  completely 
explained  by  boyness,  since  there  is  obviously  no  element  of  girluess  re- 
quired for  its  occurrence.  The  completeness  of  this  explanation  should 
register  itself  in  any  index  that  may  be  contrived  for  that  purpose. 

In  order  to  provide  a single  meusiuo  of  association  for  such  a 2x2 
table,  the  English  statistician,  G.  Udney  Yule,  proposed  the  following 
coeiFicient  of  association,  which  he  labeled  UQ'*  in  honor  of  the  nine- 
teenth century  statistician,  QuJtelet: 


where  o,  6,  c,  a are  the  joint  frequencies  conventionally  arranged  in  the 
fourfold  table  in  the  following  manner: 


Thi3  index  of  unity  — a result  already  anticipated  is  the  obvious 
measure  of  the  complete  association  between  maleness  and  delinquency, 
since  all  delinquents  are  male,  and  all  females  are  necessarily  non-delin- 
quent. 

Sensitivity  of  Q.  Any  index  must  be  able  to  discriminate  to  some  extent 
between  slight  variations  in  the  data.  It  will  later  become  apparent 
that  each  of  ti  e various  measures  of  association  has  its  own  idiosyncra- 


ad  H-  be 


(9.2.1) 


a b 


c (1 


Applying  the  formula  to  Table  9.2.3 


o = (2Q  x 5°)  r (30j<  0) 
V (20  X 50)  + (30  X 0) 
_ 1000  - 0 
1000  + 0 
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ties  in  reflecting  gradations  of  association.  Yule’s  Q,  being  our  first 
example  of  a measure  of  association,  in  its  simplicity  supplies  us  with  4 
convenient  introduction  to  this  type  of  analysis  which  will,  in  its  turn, 
throiv  considerable  light  on  all  subsequent  instances.  How  sensitive, 
then,  is  Q to  the  variations  in  the  data? 

First  let  us  assume  that  not  all  the  delinquents  are  boys,  but  rather 
that  1 and  5 girls  in  50,  respectively,  arc  delinquents.  (See  a and  6 of 
Table  9.2.4.)  In  both  instances,  there  is  still  a positive  association  be- 

Table  9-2.4  Sensitivity  of  Q 


DeL 

Non-del. 


(a) 

(6) 

I 

3 

B 

G 

B 

G 

B 

G 

19 

1 

20 

lo 

5 

20 

10 

10 

20 

31 

49 

SO 

35 

45 

SO 

40 

40 

SO 

50 

50 

100 

50 

50 

100 

50 

50 

100 

Q 

» .94 

Q 

« .59 

< 

1 = 0 

) 

tween  delinquency  and  boyncss,  but  not  as  complete  as  in  the  first  table. 
Thus,  the  value  of  the  index  is  reduced;  it  reaches  zero  when  a delin- 
quent is  just  as  likely  to  be  a boy  as  a girl.  Sex  and  delinquency  are  then 
said  to  be  independent;  pure  chance  prevails.  Statistically  speaking, 
the  internal  cell  ratios  are  identical  with  the  corresponding  marginal 
ratios.  You  might  as  well  toss  a coin  for  your  prediction;  hence,  <2  = 0. 

Effect  of  Marginal  Ratios.  We  have  seen  how  Q reflects  the  changes  in 
row  (column)  ratios,  with  marginals  remaining  constant.  However,  Q 
is  not  at  ail  sensitive  to  changes  in  marginal  ratios,  so  long  as  the  corre- 
sponding cell  ratio  remains  fixed.  For  purposes  in  hand,  let  us  construe 
the  column  subtotals  as  samples  of  100  boys  and  50  girls  (Table  9.2.5). 


Table  9.2 .5  Stab  ility  of  Q 


Del. 

Non-del. 


(o) 


B G 


10 

3 

90 

47 

100  50 

Q = .27 


13 

137 

150 


B 

G 

10 

6 

16 

90 

04 

• 1S1 

100 

100 

200 

(c) 

h 

G 

20 

6 

26 

90 

47 

137 

110 

53 

163 

Q = .27 


Q - .27 
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One  might  intuitively  suppose  that  the  observed  association  of  Q «*  .27 
may  be  due  In  part  to  the  fact  that  there  are  twice  as  many  boys  as  girls 
in  the  total  sample  of  150,  and  that  by  doubling  the  sample  of  girls  the 
Index  might  Increase.  Let  us,  therefore,  douhle  the  number  of  girls, 
retaining,  however,  the  same  rate  of  delinquency  among  the  girls.  It 
will  bo  observed  that  the  value  of  Q remains  unchanged  under  this  infla- 
tion. Similarly,  if  wc  double  the  supply  of  delinquents  from  13  to  20 
£>y  multiplying  the  frequencies  of  that  row  bv  2,  the  value  of  Q again 
remains  unchanged. 

In  general,  therefore,  a change  in  the  relative  size  of  (ho  marginals  docs 
not  affect  the  value  of  Q,  so  long  as  the  ratios  within  either  columns  or 
rows  remain  undisturbed.  As  we  shall  sec  later,  this  docs  not  hold  for 
0,  another  index  cf  association  for  fourfold  tables.  In  this  respect  Q is 
more  stable  than  is  <£.  Whether  this  type  of  insensitivity  is  desirable  — 
and  should  therefore  be  considered  a virtue  of  stability  — or  whether  it 
is  a defect  of  the  formula,  must  be  left  to  the  good  judgment  of  the  worker 
in  terms  of  his  purposc3.  It  is  doubtless  important  to  know  that  the 
number  of  observations  in  a subclass  or  stratum  does  uot,  of  itself,  influ- 
ence Ihe  size  of  Q . 


Definition  of  Association:  Observed  one f dance  Fr*qucv.ci(S.  ' The  above 
illustrative  tables  yielded,  among  other  things,  the  irnnortant  gcner.dira* 
tion  that,  when  the  internal  cell  ratios  and  corresponding  marginal  ratios 
coincide,  and  there  is  no  association,  the  index  must  be  zero.  Only  when 
these  two  sets  of  ratios  differ,  and  to  thi  extent  of  that  difference,  is 
there  statistical  association.  Indeed,  the  formula  for  0. tests  on  this 
principle  of  t lie  discrepancy  between  the  observed  and  chance  cell  fre- 
quencies. 

The  chance  frequencies  are  easily  derivable  from  the  marginal  frequen- 
cies accotding  to  t lie  following  logic:  the  boys  constitute  50  per  cent  of 
the  childien;  hence,  if  the  two  sexes  arc  equally  susceptible  to  delin- 
quency (or,  in  other  word*,  If  the  sox  factor  hte  no  influence  on  the  pro- 
due  ;ion  of  delinquency  — i.e.,  if  sex  oi  1 delinquency  are  independent), 
the  boys  would  also  have  50  per  cc<U  of  the  delinquencies.  To  express 
thi<  in  still  another  way.  if  dcUnqiier.cN  were  divided  impartially  (by 
chance)  among  the  two  sexeq  the  boys'  quota  would  be  50  per  cent,  or 
in  terms  cf  our  example,  the  lays  would  have  10  of  the  20  delinquencies. 
Arithmetically  put: 


A0  v2f* 
100x?^ 


10 


In  essence,  we  simply  adjust  the  cell  ratio*  to  bo  identical  with  the  cor* 
responding  marginal  ra*N  (Tablo  9.2Aj-  The  boys  actually  show  5 more 
delinquency  than  would  be  exported  by  chance,  while  the  girts  show 


John  H.  Mueller  and  Karl  F.  Schuessler 


Table  9.2.6  Observed  and  Char.ce  Frcqnenciet 


Observed 

Eiptcltd 

B 

G 

B 

G 

Del. 

15 

5 

no 

10 

to 

20 

Non-del. 

35 

45 

SO 

40 

40 

SO 

50 

50 

100 

50 

50 

too 

Q 

«*  .59 

Q-0 

the.  corresponding  deficiency.  There  is  therefore  a positivo  affinity  be- 
tween boys  nnd  delinquency.  However,  since  the  excess  of  15  over  10 
is  fairly  moderate,  the  index  reveals  only  a moderate  association:  Q « .59. 
But  it  is  the  amount  of  this  excess  that  constitutes,  by  definition,  the 
degree  of  association.  This  difference  between  the  observed  and  ex- 
pected frequencies  is  conventionally  symbolized  as  (0  - B)  — an  expres- 
sion that  has  achieved  an  almost  epigrammatic  currency  in  statistical 
methods  of  investigation,  which  the  student  will  learn  to  appreciate. 

Signt.  Although  mathematically  the  formula  for  Q will  yield  a 6lgn,  It 
may  not  seem  very  meaningful  when  applied  to  attributes  such  as  sex, 
religion,  or  race,  which  do  not  constitute  a hierarchy.  In  the  foregoing 
illustration,  therefore,  one  may  say  that  male  ness  is  positively  associated 
with  dr'inquc’cy,  but  it  cannot  be  verbalised  as  "the  higher  the  sex,  the 
higher  the  delinquency  rate.”  Nevertheless,  we  may  be  guided  by  the 
algebraic  sign  which  always  refers  to  the  association  indicated  In  cells 
a and  d.  When  the  «;0n  is  positive,  the  joint  frequencies  of  the  attri- 
butes intersecting  in  cells  a and  d exceed  chance  (and  are  therefore  linked 
together);  when  the  sign  is  negative,  the  frequencies  In  cells  a and  d 
ore  less  than  would  toe  expected  by  chance,  and  therefore  "repel”  each 
other.  By  corollary,  cells  5 and  c are  read  in  the  opposite  manner.  The 
assumption  is,  of  course,  that  lire  alphabetical  designations  of  cells  are 
made  in  the  conventional  order. 

Tup*  of  Do 'a  lo  Which  Q It  Applicable.  While  Q is  singularly  appropriate 
for  genuine  attributes,  nevertheless  this  measure  may  at  times  be  applied 
to  continuous  variables  by  prudently  cutting  them  into  dichotomies. 
For  example,  persons  may  be  tub-classified  as  under  and  over  31  years, 
and  responses  may  be  either  positive  or  negative,  although  these  latter 
usually  shade  into  each  other  (see  Table  9.2.7). 

There  are,  however,  two  cautions  which  should  always  bo  observed 
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Table  9.2.7 
Opinion  by  Aye 


Age 


-21 
21  + 


Opmion 
Yes  No 
% % 


CO 

40 

40 

60 

100  100 


In  such  dichotoniization.  (1)  Such  compression  of  a mass  of  continuous, 
detailed  observations  into  two  broad  dichotomies  may  be  an  expensive 
waste  of  perfectly  good  data,  and  naturally  reduces  the  potential  pre- 
cision of  the  resulting  nuasutc.  Instead  of  cavalierly  discarding  costly 
accuracy,  it  might  be  advisable  to  use  other  available  measures  which 
can  lake  such  precision  into  account.  (2)  A less  obvious  trap  for  the 
unwary  operator  is  the  more  or  less  arbitrary  location  of  the  cutting 
point,  which  introduces  cn  unpredictable  effect  upon  the  ultimate  Index. 
If,  In  the  above  table,  the  culling  point  had  been  set  at  30  years  instead 
of  21,  the  cell  frequencies  might  have  been  radically  changed,  and  the 
Q-measurc  substantially  altered.  Such  aibitrary  decisions  gravely  reduce 
the  reliability  and  comparability  of  any  measure,  including  Q.  The 
student  will  remember  the  general  principle  that  all  grouping  has  its 
haiards;  but  when  everything  is  staked  on  one  cutting  point,  the  pre- 
cariousness of  the  undertaking  is  increased. 

The  usefulness  of  Q also  varies  with  the  general  pattern  of  distri- 
bution of  the  data.  These  patterns  may  be  particularized  as  follows: 
(1)  the  division  of  frequencies  within  each  marginal  set;  (2)  the  similarity 
(dissimilarity)  between  the  two  marginal  sets;  and  (3)  the  Internal  dis- 
tribution of  cell  frequencies.  Thus,  an  individual  marginal  set  maybe  sym- 
metrical (equal  division)  or  skewed;  the  two  marginal  sets  may  be  more 
or  less  Identical  in  degiee  of  symmctiy;  and  the  cell  frequeade*  may 
be  approximately  evenly  dispersed  in  alt  four  cells,  concentrated  in  three 
cells,  or  in  two  diagonal  cel’s  (see  Tables  0.2.$a  and  0.2.8b). 

It  will  become  increasingly  evident  that  Q is  most  useful,  and  least 


TaMe  9-2.Sa  Like  Marginal  Sets 


30 

23 

33 

5 

2-7 

0 

30 

30 

23 

23 

SO 

?0 

3 

75 

30 

0 

30 

33 

30 

103 

10 

23 

\'Y) 

so 

30 

too 

O 

ERIC  • 
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Table  9.2.8b  Unlike  Marginal  Sds 


50 

0 

50 

6 

to 

45 

25 

25 

50 

Jo 

40 

55 

75 

25 

too 

20 

SO 

100 

competitive  with  other  measures  of  association,  in  those  situations  where 
the  cases  fall  predominantly  in  three  of  the  four  cells.  Such  a distribu- 
tion, which  is  evidence  of  a one-way  association,  is  possible  only  when 
marginal  sets  are  unlike.  Hence,  Q is  a likely  choice  whenever  marginal 
rets  are  dissimilar.  The  reason  for  this  will  become  clearer  after  ^ and 
other  indexes  of  association  have  been  studied. 

Interpretation  of  Yule't  Q.  For  all  its  apparent  simplicity  and  precision, 
Q possesses  no  simple  quantitative  meaning,  and  is  not  convertible  into 
a specific  prediction.*  For  example,  if  the  association  between  sex  and 
delinquency  is  .5,  one  cannot  say  that  50  per  cent  of  a given  sex  is  delin- 
quent, or  that  one  sex  Is  twice  ns  delinquent  as  another,  or  that  sex  ac- 
counts for  50  per  ccut  of  all  delinquency.  A Q-valuc  of  .8  is  not  twice 
as  strong  as  .4,  although  it  is  stronger  than  all  values  less  than  .8. 

Not  even  is  a coefficient  of  1.00  wholly  unambiguous.  In  the  previous 
example,  the  perfect  Q did  not  depend  on  the  tendency  of  boys  to  become 
delinquent,  but  mther  on  the  relative  tendency  that  a delinquency  be 
committed  by  a boy.  Actually,  in  absolute  terms,  tho  tendency  of  a 
boy  to  be  delinquent  could  bo  very  weak,  and  Q still  be  1.00,  provided 
that  no  girls  are  delinquent.  A coefficient  of  1.00  therefore  registers  the 
presence  of  one-way  association,  but  docs  not  reflect  the  degree  of  pre- 
diction in  the  other  direction,  although  that  too  may  be  complete.  Wo 
may  say,  therefore,  that  Q is  primarily  a measure  of  one-way  asso- 
ciation. 

In  general,  it  would  seem  to  be  sound  practice  to  quote  the  original 
2X2  tabulation  along  with  the  calculated  value  of  Q.  Not  only  is  the 
table  necessary  in  order  to  identify  linhugi?  between  specific  attributes; 
it  also  pcimits  the  render  to  examine  for  himself  the  detailed  structure 
of  the  relationship.  But  if  it  is  a single  index  that  is  required,  theu  one 
must  accept  the  limitations  of  such  a condensation,  just  as  the  mean 
was  accepted  as  a limited  representative  of  the  full  array. 

•A  precise  probaVliilie  Interpretation  of  0 has  recently  been  formulated.  See 
L A.  CKxh1.t_.  1 tod  \\\  K.  Krusknl,  "Measures  of  A -socb.tii.i  for  Cross  CtasrifWatk»t.«,” 
Jiwnv l of  At  An*ri<an  SttlitUtol  Attttiotitn,  XI.IX,  1951,  p.  7SO 
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Section’  Two 

Scat  t<r  Diagram:  and  Cent}  at  ion  Table 

In  the  previous  section,  we  were  concerned  with  the  problem  of  meis- 
uring  the  amount  of  agreement  between  ranks,  or  ordinal  measures. 
However,  there  are  numerous  concepts  in  sociology  that  are  subject  to 
interval  measurement:  for  example,  wo  may  measure  fertility  in  terms  of 
the  number  of  births  per  100  women  or  wages  in  terms  of  dollars  and 
cents.  Wien  the  data  being  correlated  consist-  of  scaled  variables,  we 
naturally  employ  techniques  that  arc  appropriate  to  that  typo  of  quantU 
tative  data.  While  these  correlational  techniques  differ  somewhat  from 
those  previously  presented,  they  all  answer  to  the  same  purpose:  namely, 
to  express  as  precisely  as  possible  the  degree  of  relationship  between  two 
or  more  variables.  The  ensuing  discussion  is  restricted  to  the  most 
prevalent  of  all  correlation  measures:  the  product-moment  correlation 
coefficient  (r)  and  the  correlation  ratio  (t?). 

The  Scatter  Diagram.  The  measurement  of  covariation  can  be  opproaehed 
in  a preliminary  manner  by  means  of  the  scatter  <iiagramt  which  h related 
to  birartofe  data  ; ' much  the  same  way  as  the  histogram  is  to  univariate 
data.  It  reve  at  a glance  the  entire  disposition  of  items,  thereby 
enabling  us  \<  . \ ivc  at  a rough  but  useful  estimate  of  the  strength  of 
the  correlation  l ore  wo  actually  measure  it.  This  indispcttsablc  device 
has  already  been  employed  to  depict  the  trend  of  a time  series  (p.  99) 
and  the  degree  of  correspondence  between  ranks  (p.  274);  however,  the 
details  of  its  construction  have  not  yet  been  elaborated. 

To  illustrate  more  fully  its  construction  atul  use,  wc  take  as  our  point 
of  departure  Table  10.2.1,  which  presents  yearly  income  averages  and 
suicide  nites  for  the  U.S.  Central  states,  10211-1030.  To  reduce  the  data 
of  this  table  to  a scatter  diagram,  wc  fir-t  draw  horizontal  and  vertical 
axes,  os  in  the  construction  of  a histogram.  Axes  are  drawn  approxi- 
mately equal  in  length,  unless  there  is  good  reason  to  deviate  from  th‘i 
convention.  Also,  as  a matter  of  convention,  the  independent  X-variablo 
is  plotted  along  the  base  line,  and  lie  dependent  Invariable  along  the 
vertical  axis.  The  establishment  of  oi  e variable  as  the  independent,  and 
the  other  as  dependent,  is,  of  course,  rot  a statistical  problem,  but  rather 
a matter  of  judgment  and  circumstance.  The  dependent  variable  may 
be  construed  os  the  effect  of  the  independent  variable  as  cause,  or  the 
outcome  to  be  predicted  from  the  predictor  variable.  In  many  instances, 
there  may  be  no  clear  "causal''  dependency  at  all,  since  both  variables 
may  be  viewed  as  the  consequence  of  an  unidentified  third  factor. 

Seales  arc  next  established  on  the  aw*  in  such  a manner  as  to  aceom- 
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Table  10.2.1 

5u/ctWe  /?alc  6i/ 

Per  Capita  Income,  Cat - 
fra!  IAS.,  1929-1950 


i 


Year 

| Plf  Capita 
Incomb 

m 

StHCtbE 

Rate 

<n 

1929 

G04 

15.4 

1030 

532 

17.9 

1031 

ISO 

19.1 

1932 

399 

19.3 

1933 

100 

18.1 

1931 

437 

16.9 

1935 

4S2 

15.5 

1930 

519 

15.2 

1937 

507 

16.3 

1935 

515 

16.7 

1939 

506 

15.3 

1910 

602 

15.2 

1911 

70S 

14.0 

1912 

SOO 

12.8 

1013 

911 

11.0 

1911 

072 

11.0 

1915 

9S5 

12.3 

1910 

951 

. 123 

1917 

S97 

12. 1 

1915 

92S 

11.6 

1919 

STS 

11.9 

1950 

917 

11.9 

Source:  Doruld  S'iicilc  11  /W.tffo/i  l* 

t/rto^iVah*»n  C':d  Ru*.  Vnnubf1#htd  Thwli* 

Department  of  S^cioJo;)*.  Indiana  Utivtmtjr,  ltof 
Tiblo  5. 


modatc,  with  a margin  to  spare,  the  observed  ranges  of  ttio  respective 
variables.  Thus,  the  horizontal  scale  covers  the  distance  from  $200  to 
$1,200,  while  the  vertical  scale  extends  from  10  to  20.  Needless  to  say, 
enough  markers  arc  set  up  on  each  axis  to  ensure  accurate  and  effortless 
plotting.  Unlike  the  histogram,  the  vertical  scale  in  a bivariate  plot  need 
not  begin  with  zero,  for  the  reason  that  the  focus  of  attention  is  on  the 
contour  of  the  scatter  rather  than  on  relative  frequency  as  gauged  by 
the  height  of  the  curve.  In  this  respect,  the  scatter  diagram  is  analogous 
to  the  semi-log  time  chart,  whose  meaning  likewise  inh.'res  in  its  shape 
rather  than  location. 

Having  drawn  and  sealed  the  axes,  we  are  ready  to  plot  each  pair 
of  values  by  a double-duty  point.  For  any  pair  of  variates,  the  K*valuo 
fixes  the  height  of  the  pou  t above  the  b v*e  line  and  the  ,V*value  fixes 
its  horizontal  distance  from  the  vertical  a\i*  Thu*.  1910  is  reprobated 
in  Figure  10.2.1  by  a point  located  at  the  inter- r :ii on  of  guide  lines  cx* 
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tending  perpendicularly  from  Y « 15.2  and  X = C02;  similarly,  1950  is 
represented  by  a point  at  the  infciieclio’i  of  It. 9 and  947.  The  swarm 
of  all  such  points  constitutes  the  scatter  diagram. 

Types  of  Scalier,  It  is  the  pattern  of  this  swarm  that  enables  us  to  judge 
the  nature  of  the  relationship  — and  such  a judgment  is  an  essential 
preliminary  to  its  proper  measurement.  Thus,  it  appears  that  a fixed 
increase  in  income  is  accompanied  by  a fixed  decrease  in  the  suicide  rate; 
that  is,  the  suicide  rate  changes  by  a constant  amount  per  unit  income. 
Such  a relation  is  termed  rttlitintar,  or  simply  linear,  because  the  trend  of 
scatter  conforms  to  the  track  of  a straight  line. 

Any  such  trend  line,  whether  freehand  or  mathematically  fitted,  Is 
technically  termed  a fine  of  regression.  This  concept  was  coined  by 
Galton  in  18??,  who  used  it  in  connection  with  his  correlational  studies 
of  the  characteristics  cf  parents  and  their  offspring.  He  perceived  that 
such  a line  effectively  expressed  the  tendency  of  children  to  "regress1' 
to  the  average  level  of  the  parents  in  a wide  variety  of  trails.  The  term 
has  survived  and  enjoys  a wide  currency,  although  it  i3  do  longer  re- 
stricted to  its  original  connotation. 
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Furthermore,  the  relation  in  Figuro  10.2.1  is  said  to  be  inverse,  since 
the  two  series  move  in  opposite  directions:  as  income  rises,  the  suicide 
rate  falls.  If  suicide  and  income  had  risen  together,  then  the  trend  of 
points  would  have  been  upward,  reading  from  left  to  right.  Such  a 
sloping  trend  line  (Figure  10.2.2)  is  evidence  of  a direct  linear  relation* 


FARM  ACr.F  A OS  (,Y) 

FlOCRE  10.2.2  Family  Siti  by  Perm  Acrtajt 

ship  between  two  variables;  family  sire  increases  as  farm  acreage  in- 
creases. 

The  trend  of  the  scatter  will  not  always  be  linear;  rather  it  may  be 
curriUntar  and  take  on  any  one  of  innumerable  curve  patterns.  A simple 
example  is  provided  in  Figure  10.2.3,  which  portrays  the  relation  between 
favorableness  of  attitude  toward  a minority  people  and  Intensity  of 
feeling.  As  might  be  anticipated,  a decided  opinion  — whether  pro  or 
con  — is  held  with  considerable  intensity  of  feeling,  while  a less  decided 
or  neutral  opinion  ntou*es  no  very  strong  feelings.  Whether  this  pattern 
— strong  opinion,  strong  afTcct;  weak  opinion,  peak  affect  — holds  under 
all  conditions  is  not  here  our  concern.  That  would  be  a matter  for  em- 
pirical investigation.  Here  we  i re  merely  interested  in  exhibiting  a type 
of  relationship  which  is  fairly  common  in  sociological  studies. 
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Figure  10.2.3  Scatter  Diagram,  fr.Unsily  of  Foiling  by 
Favor  abkntu  of  Attitude  ( llypo’.kitico ! Data) 


The  three  scatter  diagrams  presented  thus  far  have  distinctive  trends 
and  one  would  have  little  hesitancy  hr  describing  them  as  exclusively 
linear  or  curvilinear.  But  scatters  of  empirical  observations  are  seldom 
so  clean  and  unambiguous;  moie  often  bo»h  linear  and  curvilinear  lend* 
encies  combine  in  the  same  set  of  data  and  thereby  complicate  the  prob- 
lem of  representing  correlation  by  on  over-all  measure.  For  example, 
the  scatter  of  delinquency  rates  and  average  monthly  rentals  for  140 
small  census  tracts  In  the  city  of  Chicago  (Figure  10.2.4)  Is  marked  by 
some  linearity,  yet  it  appears  that  a curved  trend  line  would  better  fit 
the  entire  scatter.  From  left  to  right,  as  rentals  increase,  the  delin- 
quency rate  responds  by  decreasing,  but  at  a progressively  slower  rale. 
This  is  evidenced  by  the  straightening  of  the  swarm.  There  ate  of  course 
several  striking  exceptions  to  the  foregoing  gcncraliration  — a few  ex- 
tremely high  delinquency  rates  occur  with  above-average  rental*.  These 
mavericks  would  require  special  analysis,  since  they  represent  a break- 
down in  the  “law.”  Yet,  in  the  main,  the  law  of  relationship  holds  fairly 
‘ well,  affording  a measure  of  predictability  of  one  variable  from  the  other. 
If  we  knew,  for  example,  the  rental  to  be  •'ICO,  wo  would  predict  the  delin- 
quency rate  lobe  approximately  3.0,  which  is  the  height  of  the  free-hand 
rtgrettion  cu ret  at  that  point.  To  be  sure,  such  a piediction  would  not 
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Fiouf.e.  10.2.4  Scatter  Diagram  FrahAnd  Regression  Line,  0c- 
lin^uencg  Rote  by  Monthly  Rmict,  1^0  focal  Aw?,  Chicago,  1930 


be  free  of  error,  for  the  obvious  reason  that  none  of  the  observed  values 
fall  right  on  the  curve  at  that  point  — all  deviate  to  a greater  or  lesser 
extent.  Evidently,  the  accuracy  of  any  such  predictio*  would  vary 
according  to  the  tendency  of  the  points  to  hug  the  line  of  relationship 
between  the  two  series.  When  the  points  move  within  a narrow  lane, 
predictive  accuracy,  and  therefore  coi relation,  would  be  high;  when  the 
points  are  widely  scattered,  predictive  accuracy  would  be  correspond* 
ingly  low.  Only  when  all  points  fall  right  cn  the  regression  line  would 
prediction  and  correlation  b?  perfect  At  the  other  extreme,  when  the 
scatter  is  puicly  random,  then  v.e  may  ju  t ts  well  ignore  the  so-called 
"predictor  variable."  lor  any  or  nil  rental?,  our  best  guess  would  be 
the  over-all  mean  of  the  delinquency  rates, 

SccdasUcity.  Knowing  the  rental  to  \ wo  still  couhl  not  accurately 
forecast  the  level  of  delinquency,  since  the  deli: .q1. Jersey  values  are  widely 
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scattered  for  that  value;  on  the  other  hand,  for  a rental  of  STD,  we  could 
predict  with  much  greater  accuracy,  owing  to  the  bunching  of  delinquency 
rales  around  the  regression  line.  This  scatter  of  K- values  for  the  respec- 
tive .Y-values  is  known  as  scafoclfcilj.  If  the  degree  of  variation  in 
delinquency  rates- — the  width  of  the  scatter  band — had  been  uniform 
for  all  values  of  X,  then  we  could  have  spoken  of  Y as  being  homoscc- 
dastic  in  respect  to  A*.  Actually,  the  degree  of  scatter  in  Y diminishes 
as  X changes,  so  that  Y is  hctcrosnJatUc  in  respect  to  A'.  Heterosce- 
dasticity  implies  that  the  degree  of  correlation  is  not  uniform  throughout 
the  entire  scries;  hence,  its  presence  reduces  the  feasibility  of  a single 
over-all  measure  of  correlation,  which  is  after  all  an  average.  Just  os 
we  hesitate  to  compute  the  mean  of  heterogeneous  bimodal  data,  similarly 
we  hesitate  to  calculate  an  average  measure  of  correlation  of  a hcto.rosce- 
dastic  scatter. 

Hclerosccdasticity  may  be  even  more  glaring  than  that  exhibited  in 
Figure  10.2.4;  the  scatter  may  be  gourd-shaped,  dumbbell-shaped,  or 
J-‘hapcd,  as  in  Figure  10.2.5.  These  oddly  shaped  scatter  diagrams  by 


Figure  10.2.5  Srfuhd  T s of  SccAiif 

no  means  exhaust  the  variety  of  types  that  may  be  encountered  in  prac- 
tical work.  However,  they  do  serve  to  ratify  the  utility  of  this  visual 
aid.  Although  the  scatter  diagram  yields  no  mathematical  measure  of 
coirelation,  it  does  indicate  (a)  whether  the  relationship  is  simply  recti- 
linear or  more  complex,  (b)  whether  cr  not  the  relationship  is  consistent 
over  the  entire  range,  and  (c)  whether  the  relation  is  strong  or  weak. 
It  is  an  indispensable  tool  and  plays  the  same  role  in  correlation  ns  does 
the  frequency  graph  in  the  processing  of  univariate  data,  ft  provides 
a bi.d’s-eyo  view  of  the  whole  distribution. 

The  Joint  Frequency  TaUe.  Instead  of  plotting  the  individual  items  of 
the  bivariate  data,  we  may  group  them.  Such  grouping  answers  the 
general  purposes  of  all  grouping:  (1)  to  reveal  the  basic  pattern  of  dis- 
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tribution,  and  (2)  to  facilitate  the  statistical  processing  of  the  data.  In 
grouping  bivariate  data,  we  classify  each  case  simultaneously  in  two  class 
intervals,  thereby  locating  each  case  at  the  intersection  of  a given  row 
and  given  column.  Hence,  it  is  ns  though  we  superimposed  a grid  on 
the  scatter  diagram,  counted  the  points  in  each  cell  and  inserted  tho 
corresponding  number.  Such  an  operation,  applied  to  the  scatter  of 
delinquency  rates  and  average  rentals  (Figure  10.2.0),  would  yield  the 
joint  frequencies  of  Tabic  10.2.2. 
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Naturally,  in  any  real  situation  we  would  not  proceed  in  this  some- 
what fanciful  manner;  rather,  we  would  immediately  tally  the  unarrnyed 
bivariate  items  in  the  giid  set  up  for  that  purpose,  and  then  count  tho 
joint  occurrences  in  each  cell 
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Joinl  Frequency  Table;  Delinquency  Rales  by  Average 
Table  10.2,2  Monthly  Rcntah 


Delink 

QVENCY 

Rate 

(F) 


Average  Moxthly  Rental  rs  Dollars  (?) 

10-  20-  30-  40-  50-  CO-  70-  80-  90- 
0-9  19  29  39  49  59  69  79  89  99 
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Grouping  Procedure.  To  clarify  this  grouping  procedure,  we  list  the 
directives  observed  in  the  construction  of  Table  10.2.2. 

1.  Select  a suitable  class  interval  for  each  variable. 

2.  Mark  off  these  class  intervals  on  properly  drawn  axes,  and  from 
these  markers  extend  guide  lines  in  such  fashion  as  to  create  a gridwork 
of  cells.  Each  cell  is  taken  to  represent  the  junction  of  two  class  intervals. 

3.  Locate  each  pair  of  values  in  its  proper  cell  and  indicate  its  presence 
there  by  a tally* 1 2 3 4 5 6"  mark. 

4.  Count  tally  marks  in  each  cell  and  replace  tally  by  that  number. 
Any  such  number  represents  a joint  frequency  — i.e.,  the  frequency 
with  which  two  class  values  or  midpoints,  cceur  together. 

5.  Sum  by  rows  and  columns  ;n  order  to  obtain  the  marginal  totals. 
These  marginal  totals  of  course  constitute  the  simple  frequency  distri- 
butions of  the  individual  variables. 

6.  Sum  marginal  frequencies  to  obtain  the  grand  total  of  cases,  N , the 
one  sum  serving  as  a check  on  the  other. 

In  retrospect,  we  see  that  the  construction  of  the  joint  frequency  table 
proceeds  according  to  well-established  principles:  the  classification  should 
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be  fine  enough  to  reveal  th?  shape  of  the  joint  frequency  distribution, 
but  not  so  fine  that  some  rows  and  columns  are  completely  vacant.  But 
if  one.  is  to  err  at  the  outset,  it  is  perhaps  preferable  to  have  too  many 
cells  than  too  few.  The  optimum  arrangement  will  be  more  apparent 
in  the  light  of  too  much  detail  rather  than  too  little,  and  it  is  usually 
easier  to  amalgamate  cell  frequencies  than  to  partition  them. 

Function  of  the  Joint  Frequency  Table.  Wo  must  acknowledge  that  a 
joint  frequency  table  cannot  depict  the  type  of  relationship  so  vividly 
as  the  sealtergrew i,  for  the  reason  that  numerals  cannot  convey  so  effec- 
tively gradations  of  density  as  does  the  scatter  of  dots.  Nevertheless,  in 
spite  of  its  relative  coarseness,  it  will  often  be  more  than  adequate  for 
purposes  of  exhibiting  the  pattern  of  relationship  and  assuring  its  com- 
prehension. 

. Moreover,  the  column  and  row  distributions  of  the  joint  frequency 
table  permit  a statistical  measurement  of  scedasticity  which  would  bo 
impossible  to  obtain  from  a raw  scatter  diagram.  For  such  an  assessment, 
we  need  only  compute  the  standard  deviation  of  each  column  or  row. 
A close  similarity  among  such  standard  deviations  would  be  evidence  of 
homoscedasticity,  whereas  pronounced  differences  would  suggest  hetero- 
scedasticity. 

Furthermore,  the  margined  distributions  of  the  J-  and  F-variables  — 
which,  of  course,  are  integral  features  of  the  joint  frequency  table  — 
also  supply  important  and  recognizable  clues  to  the  possible  degree  of 
association  between  the  plotted  variable*.  Specifically,  the  marginal 
distributions  set  limits  to  the  degree  of  obtainable  correlation.  For 
example,  unlike  marginals  preclude  a perfect  linear  relationship.  This 
merely  generalizes  the  earlier  observation  in  respect  to  the  ^-coefficient 
that  perfect  two-way  association  in  contingency  tables  is  impossible  so 
long  as  marginal  sets  are  not  identical.  This  general  principle  is  illus- 
trated by  Table  10.2.3,  whose  marginals  make  it  impossible  to  locate 
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all  case?  aloug  one  diagonal,  as  would  he  required  in  perfect  rectilinear 
correlation.  No  matter  how  we  deploy  the  cases,  we  cannot  force  them 
onto  one  diagonal,  as  the  student  should  experimentally  demonstrate  for 
himself.  In  general,  the  marginals  always  act  to  constrain  in  one  way 
or  another  the  type  and  degree  of  association  between  paired  variables; 
hence,  it  is  always  necessary  to  examine  them  in  order  to  determine  the 
limits  which  they  impose,  and  the  prospects  for  the  valid  employment 
of  a given  correlation  index. 

Additionally,  the  joint  frequency  table  may  serve  as  a computing  aid 
which  may  be  called  on  especially  when  the  number  of  instances  is  large 
or  when  computing  machines  are  not  available.  Finally,  it  is  a pre- 
requisite to  the  simplest  approach  to  curvilinear  correlation,  as  will  be 
evident  in  a later  section. 

This  section  has  stressed  the  important  role  of  the  scatter  diagram  and 
the  joint  frequency  table  in  the  preliminary  analysis  of  covariation. 
From  these  charts  it  is  possible  to  determine  whether  the  relationship 
is  (a)  linear  or  curvilinear;  (b)  direct  or  inverse;  (c)  weak  or  strong;  ad- 
ditionally, (d)  whether  the  variables  are  homoseedastic  in  respect  to  each 
other;  (c)  whether  there  are  notable  exceptions  to  the  main  trend  or 
"law  of  relation,"  and  (f)  whether  the  marginal  distributions  are  sym- 
metrical or  skewed,  and  approximately  matched.  Because  of  these 
significant  disclosures,  it  is  mandatory  that,  prior  to  the  measurement 
of  correlation,  a scatter  diagram  or  joint  frequency  tabic,  or  even  both, 
be  constructed  and  carefully  studied.  Without  these  visual  aids,  we  fall 
into  the  danger  of  using  improper  procedures  and  thereby  arriving  at 
misleading  conclusions. 

Questions  and  Problems 

1*  Define  the  following  concepts: 

Scatter  Diagram 

Bivariate  Data 

Rectilinearity 

Scedasticity 

Curvilmearity 

Joint  Frequency  Table 

Marginal  Distributions 

2.  If  the  marginal  distributions  arc  normal  in  shape,  will  the  scatter  necessarily 
be  homoseedastic?  Illustrate  and  explain. 

3.  Is  a perfect  rectilinear  scatter  possible,  if  marginal  distributions  are  unlike? 
Illustrate  your  answer  by  table  or  graph. 

4.  Docs  the  presence  of  homosccdasticUy  in  a scatter  guarantee  linearity?  Illus- 
trate by  sketch. 

o.  Docs  perfect  rectilinearity  in  a scatter  require  identical  marginals?  Illustrate. 
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Table  10.2.4 

Clothing  Expenditures  by 
Total  Family  Expendi- 
tures, U,S. 


Total 

EXPENDITURES 
{'00  ov  Dollars) 
(X) 

Clothino 
Expenditures 
(Per  Cent  of  Total) 

(F) 

9 

5 

10 

8 

14 

12 

18 

13 

20 

14 

24 

15 

2S 

16 

34 

18 

43 

20 

Table  10.2.5 


Per  Cent  Democratic  Vote 
by  Voter  Registration, 
Selected  Stales,  Congres- 
sional Elections,  1952 


State 

Democratic  Vote 
(%  of  Total) 

Per  Cent 
Registered 

Ariss. 

52 

78 

Ark. 

85 

50 

Cal. 

44 

88 

Colo. 

44 

91 

Conn. 

46 

91 

FJa. 

74 

75 

Ga. 

100 

59 

Idaho 

41 

‘ 96 

Ind. 

43 

96 

La. 

91 

67 

Md. 

43 

62 

Mas3. 

46 

SS 

Mont. 

43 

83 

Ncv. 

50 

81 

N.H. 

37 

92 

N.J. 

42 

85 

N.Y. 

40 

80 

Ohio 

44 

59 

Ore. 

39 

87 

Pa. 

43 

78 

R.I. 

54 

87 

S.C. 

93 

49 

S.D. 

31 

83 

Yt. 

23 

88 

Va. 

67 

33 

Wash. 

43 

92 

Source:  US.  r»ur«v.u  of  the  Cen*mr  S&jliifimr  Attract 
of  the  U.S.,  2P33,  U.3.  Government  Printing  Office, 
Washington,  D.C.,  1052 
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6.  If  a scatter  is  homosecoastic,  docs  it  follow  that  the  relationship  is  close? 

7.  (a)  Construct  a scatter  diagram  for  the  data  in  Table  10.2.4. 

(b)  Formulate  in  your  own  words  the  “ law  of  relation,” 

S.  (a)  Plot  a scatter  diagram  for  the  following  data  of  Table  10.2,5. 

(b)  Characterize  the  relationship.  Docs  a preponderant  one-party  vote  appear 
to  depress  voter  registration? 

9.  Construct  the  scatter  diagram  for  the  delinquency  and  recidivism  rates 
{Table  10.2.0).  How  would  this  trend  be  described  in  words? 


Table  10.2.6 

Delinquency  and  Recidivism 
Rotes,  Chicago , 1929 


Detjnqukxcv 
Rate  pek  100 

Recidivism 

Rate 

0.5 

26.1 

1.5 

26.6 

2.5 

35.1 

3.5 

30.1 

4.5 

38.7 

5.5 

54.8 

6.5 

42.9 

7.5 

54.7 

3.5 

32.4 

9.5  . 

52  3 

10.5 

66.7 

11.5 

50.2 

12.5 

59.9 

13.5 

65.9 

15.5 

53.6 

16.5 

80.3 

19.5 

61.1 

Source:  Clifford  R.  Shaw*,  Delinquency 

Area  The  University  of  Chicago  Press, 
1920.  T&ble  21,  p,  181. 


10.  (a)  Plot  a scatter  diagram  from  Table  10.2.7. 

(b)  What  does  the  scatter  of  this  graph  suggest  concerning  the  11  pull”  of 
economic  conditions?  Explain  the  two  fairly  distinct  clusters. 

11.  Construct  a joint  frequency  table  for  the  data  in  Table  10,2.7.  Divide  Income 
Index  into  ten  class  intervals  of  width  10,  lower  rounded  limit  50;  divide 
Per  Cent  Bom  in  Other  States  into  ten  class  intervals  of  width  10,  lower  limit  0. 
Tally  marginal  frequencies. 

12.  Sketch  the  marginal  distributions  which  would  yield  the  scatters  shown  in 
Figure  10,2.7. 
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Covariation:  Quantitative  Variabi.es 

Table  10.2.7  Per  Cent  Born  in  Other  Slates  by  Income  Index,  U.S. 


State 

Index  of 
Income  * 

% Born 
in  Other 
States 

State 

Index  or 
Income 

% Born 
in  Other 
States 

Ala. 

62 

11 

Neb. 

96 

25 

Arir. 

91 

44 

Nev. 

137 

68 

Ark. 

58 

22 

N.H. 

99 

30 

Cal. 

124 

58 

N.J. 

120 

32 

Col. 

99 

50 

N.  Mew 

81 

44 

Conn. 

127 

28 

N.Y. 

124 

17 

Del. 

138 

35 

N.C. 

64 

12  ’ 

Fla. 

80 

54 

N.  Dak. 

75 

23 

Ga. 

69 

14 

Ohio 

115 

24 

Idaho 

88 

49 

Okla. 

78 

39 

III. 

121 

25 

Ore. 

100 

57 

Ind. 

103 

25 

Penn. 

104 

12 

Iowa 

94 

19 

R.I. 

101 

24 

Kansas 

104 

32 

S.C. 

67 

11 

Ky. 

69 

12 

S.  Dak. 

77 

29 

La. 

74 

15 

Ten  a. 

69 

20 

Me. 

83 

12 

Texas 

89 

21 

Md. 

107 

34 

Utah 

88 

21 

Mass. 

107 

15 

Vt. 

82 

22 

Mich. 

111 

28 

Va. 

81 

25 

Minn. 

91 

21 

Wash. 

110 

54 

Miss. 

50 

11 

W.  Va. 

75 

17 

Mo. 

97 

25 

Wis. 

101 

15 

Mont. 

104 

44 

Wyoming 

9S 

60 

• Ratio  of  atato  per  capita  income  to  national  per  capita  income. 

Source:  U.S.  Bureau  of  the  Census,  Statistical  Abstract  of  the  U.S,,  t&34,  XfJS.  Government 
Printing  Office,  Washington,  D.C.,  1934,  Tabtc3  37  and  337. 


Fick/*re  10.2.7  Selected  Scatter  Patterns 


o 
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Section"  Three 

Linear  Correlation  of  Ttvo  Variables 

The  Need  for  an  Over-All  Mcavu'c  of  Correlation.  The  scatter  diagram, 
which  was  elaborated  in  the  preceding  sect  Ton,  reveals  to  the  eye  whether 
two  variables  change  together  in  a systematic  manner.  The  cluster  of 
plotted  points  around  the  hypothetical  regression  line  suggests  the  “law 
of  relation”  between  the  two  variables.  Moreover,  by  observing  the 
width  of  the  scatter,  we  can  make  at  least  a preliminary  judgment  as 
to  how  well  the  cases  conform  to  this  hypothetical  law.  Such  conclusions 
are  often  of  considerable  value;  yet  they  stilt  leave  something  to  be  de- 
sired, for  they  are  impressionistic,  subjective,  and  unstandardized.  Con- 
sequently, they  cannot  be  described  or  communicated  to  anyone  else 
without  reproducing  the  chart  — or  at  least  engaging  in  an  extended 
account  of  it.  This  is,  of  course,  impractical.  And  since  these  “eye- 
measures”  are  not  accurate  in  u mathematical  sense,  comparisons  with 
similar  measures  arc  impossible  even  if  all  the  intricate  charts  are  avail- 
able. What  we  need  therefore  is  an  objective,  standard,  synoptic  measure 
of  the  relation  between  the  two  variables.  This  will  finally  be  the  measure 
of  correlation. 

In  principle,  there  is  of  course  no  difference  between  rough  measures 
made  by  the  eye  and  those  computed  by  a mathematical  operation.  The 
latter  are  merely  move  precise  and  serviceable.  What  \vc  see  is  a hypo- 
thetical trend  line  to  which  the  swarm  of  points  move  or  less  conforms 
and  from  which  wc  deduce  the  degree  of  correlation.  But  in  order  to 
measure  this  correlation  wc  must  (1)  establish  exactly  the  position  of 
such  a line,  (2)  measure  the  amount  of  conformity  thereto,  and  (3)  trans- 
late this  result  by  a prescribed  method  into  an  index  of  correlation.  Our 
first  concern,  then,  would  be  with  the  location  of  the  line  which  best 
fits  the  observed  da  ta. 

As  we  have  previously  learned,  a relation  may  be  rectilinear  or  curvi- 
linear; hence,  the  best-fitting  line  may  be  straight  or  curved.  In  this 
section,  we  restrict  ourselves  to  those  relations  which  arc  rectilinear,  or 
approximately  so,  and  which  can  therefore  be  legitimately  represented  by 
a straight  line.  In  other  words,  our  concern  will  be  with  those  relations 
in  which  a constant  change  in  one  variable  produces,  on  the  average,  a 
constant  change  in  the  other  paired  variable.  Such  a relation  has  already 
been  illustrated  in  Figure  10.2.2,  which  plots  the  size  of  family  and  the 
size  of  farm  in  China:  family  sire  increases  by  one  person  on  the  average 
as  farm  size  increases  by  ten  acres. 

Here,  for  purposes  of  exposition,  let-  us  use  some  very  simple  data 
which  exemplify  the  relation  between  income  and  social  status  score 
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(Tabic  10.3.1a)*  It  is  reasonable  to  designate  social  status  as  the  depend- 
ent F-variable,  since  status  appears  to  adjust  itself  to  income  in  the 
long  run,  rathei  than  the  reverse.  Once  cases  have  been  plotted  to  form 
a scatter  diagram  (Figure  10.3.1),  v e wish  to  draw  the  most  representa- 
tive straight  line. 


Tabic  10.3.1a 

Income  end  Social  Status  Score , 
Selected  Families 


Source:  Suggested  by  W,  Itloyd  Warner  a 
studies  m eociat  class. 


Income  in 
Thousands 
of  Dollars 
(X) 

Social 

Status 

Score 

(V) 

3 

3 

7 

5 

11 

7 

14 

6 

15 

9 

l 

ii 

o 

i 

F = 6 

Fig  1/ he  10.3.1  Freehand  Trend  Line  on  Scatter  Diagram^ 
Income  and  Social  Status 
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Freehand  Regression  Line.  It  would  of  course  be  possible  to  draw  free- 
hand what  appears  by  inspection  lo  be  the  best-fitting  trend  line.  If  we 
had  drawn  such  a line  and  were  then  asked  to  defend  its  location,  we 
would  spontaneously  reply  that  we  had  drawn  it  “right  through  the 
middle”  — that  is,  as  close  to  all  the  points  on  the  average  as  possible, 
cutting  the  swarm  lengthwise  into  two  equal  bands.  It  was  drawn  down 
the  middle  because  we  intuitively  felt  that  this  line  was  the  best  average 
for  all  of  the  observed  points.  It  thus  serves  as  a running  estimate  of 
the  K-values,  which  takes  into  account  every  observed  value  of  X . If 
we  had  been  asked  to  guess  a person's  social  status  score  from  his  income, 
say  $10,000,  we  would  not  guess  a K- value  outside,  or  eveu  close  to  the 
edge,  of  the  region  of  scatter,  but  rather  a central  value  within  the  swarm 
above  the  designated  A'-value.  This  would  be  a status  score  of  6.  Such 
estimated  values  of  Y correspond  to  the  observed  .Yfs  and  constitute  the 
aforementioned  trend  line.  Naturally,  the  more  closely  tire  observed 
points  hug  the  line  of  estimation  — the  less  they  digress  from  it  — the 
smaller  the  error  in  prediction  of  Y from  X.  If  income  vrere  the  sole 
determinant  of  social  status,  the  observed  points  would  necessarily  all  be 
on  the  line,  without  error;  if  social  status  alone  determined  income,  the 
points  would  again  fall  on  the  line  without  residue. 

Now,  what  is  the  significance  for  correlation  of  these  departures  from 
the  hypothetical  regression  line?  To  the  statistical  analyst  it  indicates 
that  there  are  other  factors  besides  income  which  determine  social  status. 
The  operation  of  these  other,  unknown  factors  disturbs  our  prediction. 
Hence,  they  reduce  the  correlation  between  the  two  given  variables. 

Values  Expressed  as  Deviations  from  Mean.  In  Table  10.3.1a  and  the 
scatter  diagram  derived  from  it,  the  variates  are  given  in  their  original 
form.  But  for  several  reasons,  not  fully  amplified  here,  it  is  more  mean- 
ingful to  compare  the  variables  after  they  have  been  transformed  into 
deviations  from  their  respective  means,  X a ad  V. 

When  comparing  values,  as  we  do  in  correlation,  the  student  must 
accustom  himself  to  thinking  in  terms  of  deviations  from  the  mean  instead 
of  the  raw  data.  Like  so  many  other  statistical  procedures  ' ' ich  may  at 
first  seem  needlessly  indirect  and  complex,  this  tco  conf  i common 

sense.  We  do  not  ordinarily  compare  raw  test  scores,  Mth  rates; 

but  rather  consider  them  as  “high”  or  “low”  — that  ^uve  or  below 
the  average  or  norm  to  which  we  are  accustomed.  Still  less  cm.  we  com- 
pare units  of  different  categories,  such  as  incomes  and  birth  rates  in  their 
raw'  form;  wc  associate  income  below  average  with  family  size  that  ex- 
ceeds the  average.  Thus,  wc  see  that  the  mean  is  the  appropriate  point 
of  origin  — the  natural  standard  — whenever  we  compare  two  sets  of 
data  in  respect  to  their  affinity  with  each  other. 

The  items  in  Table  10.3.1b  arc  therefore  expressed  ns  deviations  from 
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their  respective  means,  and  then  plotted  in  that  form  (Flgur-j  10.3.2u). 
In  graphing  deviations,  we  must  of  course  be  able  to  accommodate  nega- 
tive values,  and  consequently  we  use  four  quadrants,  corresponding  to 


the  four  combinations  of  signs. 

Table  10,3.1b 

Incomes  and  Social  Status  Scores 
as  Deviations  from  K andT 


Income  in 

Social 

Thousands 

Status 

of  Doixabs 

Scorn; 

CO 

C v ) 

-7 

-3 

-3 

-1 

1 

1 

4 

0 

5 

3 

0 

0 

It  should  be  observed  that  the  configuration  of  plotted  deviations  is 
identical  with  the  scatter  of  original  values;  only  the  scale* markers 
have  been  changed.  All  we  did  was  to  shift  the  zero  origin  to  the  inter- 
section of  the  means. 

Explained  Deviations.  The  freehand  trend  line,  drawn  straight  through 
the  middle,  is  obviously  an  average  for*  all  of  the  observed  points,  a con- 
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tinuous  succession  of  norms,  as  it  were,  adjusted  to  the  entire  series  of 
cases.  Since  it  cousists  of  the  deviations  which  one  would  expect  on  the 
average,  the  values  on  the  sloping  trend  line  are  variously  called  the 
“expected,”  “predicted,”  or  “estimated”  values.  The  expected  devia- 
tions are  graphically  plotted  (Figure  10.3.2b)  by  vertical  lines  extending 


Figure  10.3.2b  Explained  Deviations  (y) 


from  the  x-axis  (which  coincides  with  the  mean  of  the  F-valucs)  to  the 
points  on  the  trend  line.  These  are  the  deviations  in  social  status  scores 
(y)  which  one  would  expect,  or  prcdl  if  income  {X)  were  the  oily  vari* 
able  determining  (hem.  We  therefore  state,  in  statistical  language,  that 
these  deviations,  which  may  be  cither  larger  or  smaller  thar  those  ob- 
served, are  explained  by  income,  or  attributable  to  the  income  factor. 
Consequently,  these  deviations  have  conic  to  be  known  as  explained 
deviation*. 

Unexplained  Residuals.  Since  there  are  other  factors  that  also  induence 
social  status  (such  as  occupation,  nativity,  etc.),  the  actual,  or  observed, 
values  do  not  coincide  with  the  expected  values.  The  force  of  these 
unidentified  factors  is  measured  by  the  discrepancies  between  the  observed 
(0)  and  the  expected  (E)  deviations.  The  smaller  these  discrepancies, 
the  weaker  must  be  the  force  of  the  extraneous  factors;  the  larger  those 
discrepancies,  the  stronger  these  forces  must  be.  The  discrepancies  are 
sometimes  called  residuals , and  in  terms  of  the  given  data  remain  im- 
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explained;  that  is,  they  must  be  attributed  not  to  income,  but  to  unknown 
factors  on  the  identity  of  which  we  may  only  speculate,  since  additional 
data  are  not  available  in  the  problem  as  set  up.  Hr  ace,  they  arc  called 
unexplained  residuals,  measured  by  the  respective  distances  from  the 
trend  line  to  the  individual  observed  points  in  the  swarm. 

From  inspection  of  the  foregoing  charts,  we  must  conclude  that  the 
closer  the  correspondence  between  the  explained  and  observed  devia- 
tions (i.e.,  the  smaller  the  resiauats)  the  higher  the  degree  of  correlation. 
It  would  therefore  be  logical  to  measure  the  degree  of  correlation  accord- 
ing to  the  degree  of  correspondence  between  the  estimated  and  observed 
deviations;  or  by  the  proportion  of  the  aggregate  observed  deviation 
that  is  explained.  This  is  the  fundamental  principle  of  the  measurement 
of  correlation  and  is  always  implicit  in  its  calculation.  To  be  sure,  the 
measurement  is  not  carried  out  informally  by  freehand  regression  lines 
or  by  graphic  readings  obtained  with  a ruler  or  a pair  of  dividers;  still, 
it  remains  essentially  a comparison,  or  ratio,  between  the  explained  and 
total  deviations. 

Since  these  concepts  play  such  an  important  £art  in  the  subsequent 
presentation,  it  will  be  weii  to  summarize  their  meaning  precisely: 

(1)  Total  (Observed)  Devi  alien:  the  deviation  of  an  observed  value 
from  the  mean  of  the  series. 

(2)  Explained  Deviation:  the  deviation  of  an  expected  (regression) 
value  from  the  mean  of  the  series. 
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(3)  Unexplained  Residual:  the  discrepancy  between  the  total  and  the 
explained  deviation.  This  is,  cf  course,  the  difference  between 
(1)  and  (2)  above.  . 

The  Measurement  of  Linear  Correlation 

A serious  disadvantage  of  freehand  guiphs  is  that  they  rest  on  personal 
judgment.  Without  a standard  operating  procedure,  it  is  unlikely  that 
two  workers  would  ever  locate  the  trend  line  in  exactly  the  same  position. 
Obviously,  a line  that  is  used  to  measure  correlation  in  the  manner  sug- 
gested above  must  always  meet  (lie  same  specifications;  otherwise  the 
results  will  be  lacking  in  the  reliability  which  is  essentia!  to  scientific 
procedure. 

Such  standard  specifications  have  been  formulated  as  follows:  the 
line  is  so  located  as  to  make  the  sum  of  the  vertical  residuals  around  that 
line  equal  to  zero  — that  is,  to  make  the  sum  of  the  positive  and  negative 
discrepancies  balance  one  another.  Thu?,  such  a hue  represents  (he 
scatter  around  it,  as  the  mean  represents  its  array.  And  like  the  mean, 
it  makes  the  sum  of  the  squared  deviations  a minimum  — it  conforms 
to  the  principle  o f least  squares  * Because  of  these  properties  it  is  labeled 


Figure  10-3.3  Line  of  Least  S^cra  c n SritUr  Diagram, 

Income  end  Sc<ial  <S'otM 

* TV  line  coukl  just  a*  «e'l  be  located  so  as  to  -nmfrwre  the  tufa  of  the  squired  for- 
tof  devotions,  or  r-fefkkab  llo  -tvif,  fiftet  th\i  line  *ouU  yield  the  meif- 
u*eof  corrcUtton  u the  line  ' f'VJSKd  vertical  li  unnecessary 

to  fit  itf  portion.  Oae  resre.^an  line  *i!1  luHce  for  purpoK*  of  toeasurlna  Knear 
correlation  h tvcfu  two  Timblcs. 
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the  lino  of  best  fit;  by  the  criterion  of  least  squares,  it  fits  the  scatter 
better  than  docs  any  other  straight  lire. 

The  tracing  of  this  mathematical  line  of  best  fit  is  simplified  by  again 
plotting  cases  as  deviations  from  means.  For,  under  these  conditions, 
the  line  of  least  squares  will  always  conveniently  pass  through  tho  zero 
origin,  which  lies  at  the  intersection  of  the  means.  Consequently,  it 
may  be  readily  plotted  once  its  slope  on  the  x-axis  has  been  determined. 

Compulation  of  Slope.  The  calculation  of  this  slope  is  according  to  the 
following  equation,  presented  here  without  explanation: 


where  b„  — slope  of  line  of  expected  y’s  on  observed  x’s 
(read:  "b,yonx”) 

2zy  » sum  of  products  of  paired  x-  and  y-dcviations 
(read:  "sum  of  cross-products") 

This  ratio  is  the  slope  of  the  soug’ut-for  best-fitting  line;  it  is  the  aver- 
age change  in  Y per  unit  change  in  X.  Accordingly,  to  6nd  this  average 
change  in  social  status  per  uni*,  change  in  income,  we  compute  the  cross- 
products  and  squared  x-deviatior  , and  form  the  ratio  between  their 
respective  sums.  Performing  these  operations  (Table  10.3.2),  we  obtain 


Table  10.3.2  Calculation  of  b/r 


X 

Y 

/ 

V 

t* 

*v 

a 

3 

~7 

-3 

49 

21 

7 

5 

-3 

-1 

9 

3 

it 

7 

1 

1 

1 

1 

u 

6 

4 

0 

16 

0 

15 

9 

& 

3 

25 

15 

l - 50 

50 

0 

0 

100 

40 

Xty  40 

■ t;  — — » e .4 

Jr’  too 

bf.  * .4.  This  Is  the  average  rate  of  increase  in  social  status  per  unit  of 
income;  that  is,  for  every  $1,000  increase  in  income,  there  is  an  average 
increase  of  .4  status  points.  Because  this  average  fixes  the  inclination 
of  the  regression  line  to  the  axis  of  the  independent  variable,  it  has  conic 
to  be  known  as  elope.  Once  determined,  it  enables  us  to  plot  the  line  of 
least  squares  and  proceed  to  the  pre;i-e  measurement  of  correlation. 
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Plotting  the  Regression  Line ! Since  a n.i^ht  line  is  determined  by  any 
two  points,  we  plot  two  points  kuo.ui  to  lie  on  tic  least-squares  line  and 
run  a straight  line  through  them.  For  one  of  these,  we  pick  up  the  point 
of  origin  (intersection  of  axes);  the  other  point  is  most  conveniently 
obtained  by  calculating  the  expected  value  of  y when  x « 1.00.  This 
will  of  course  be  equal  to  b„.  Accordingly,  we  draw*  a line  through  the 
zero  origin  and  a point  plotted  at  the  intersection  of  1.00  and  .4.  This 
line  is  plotted  in  Figure  10.3.3. 

It  is  this  line  that  supplies  the  explained  deviations  (y*$)  corresponding 
to  the  observed  jps,  since  the  unbroken  line  of  best  fit  necessarily  consists 
of  all  possible  explained  deviations.  From  it,  we  may  read  the  approxi- 
mate magnitudes  of  the  explained  deviations;  but  they  may  bo  more 
precisely  determined  by  application  of  the  slope  formula: 

$ - M*>  00.3.2) 

where  tj  =*  expected  deviation  in  Y for  any  given  z (read  "y-circumflcx”). 

Substituting  the  observed  j-dev intions  in  this  formula,  and  solving  for  the 
y's,  we  obtain  the  results  shown  in  Table  10.3.3.  These  y'a  are  the  y’s 


Table  10.3.3 

Complete  Set  of  y-Valucs  for  Ob- 
served x't 


6fr 

t 

i 

.4 

— 7 

-25 

.4 

-3 

-15 

.4 

1 

.4 

.4 

4 

15 

.4 

5 

2j0 

0 

0.0 

that  would  have  been  observed  if  X and  Y were  perfectly  correlated. 
But  these  were  not  observed;  hcv.cc,  we  mu<t  determine  how  close 
they  come  to  those  observed,  in  order  to  fix  the  degree  of  correlation. 

Coefficient  of  Determination.  Wc  have  thus  come  to  the  final  stage  in  the 
measurement  of  correlation.  This  measurement  proceeds  on  the  principle 
that  the  more  nearly  the  explained  deviations  approach  the  total  devia- 
tions, obviously  the  greater  is  the  proportion  of  variation  explained  and 
the  higher  the  degree  of  correlation.  As  a statistical  operation,  the 
conversion  to  a single  index  consists  of  summing  the  squared  explained 
deviations,  and  expressing  this  explained  vuiction  as  a proportion  of  the 
fofa/  ranafr'on  to  be  explained.  Symbolically, 

j Explained  Variation  ^ Sy* 

* Total  Variation  Sy* 
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r/ 

This  ratio,  be  it  noted,  is  symbolized  by  rJ.  It  is  the  fraction  of  the  total 
variation  in  the  dependent  F-variaMe  which  is  determined  by  the  inde- 
pendent X-variable;  hence,  it  is  called  the  Coefficient,  of  Determination. 
Since  it  is  impossible  to  explain  move  than  the  total  variation,  rs  can  never 
be  greater  than  unity,  and  will  prretiev  lly  always  be  less  than  one. 

Had  the  simple  algebraic  dovhlitmv  nrouud  the  mean  been  employed 
as  measures  of  variation,  they  woidd  have  summed  to  zero  — a very 
impractical  situation  which  would  t:  t have  permitted  any  ratios  at  all. 
Hence  we  turn  to  squared  deviations,  in  compliance  with  the  principle 
of  least  squares. 

But  in  order  to  circumvent  this  more  complicated  computation,  would 
it  not  have  been  possible  to  take  the  simple  arithmetic  deviations  from 
the  median?  This  was,  in  principle,  the  solution  of  Francis  Gatton  who 
originally  propagated  the  concepts  of  regression  and  correlation  in  his 
famous  work  Natural  Inheritance  However,  the  mean  has  prop- 

erties not  possessed  by  the  median  which  make  it  more  useful  in  correla- 
tion computations.  And  once  the  iw\  \w  h*<  been  chosen,  there  is  no 
alternative  but  to  adopt  the  least-squares  principle  for  its  implementa- 
tion. 

The  complete  computation  of  r U illustuted  in  Table  10.3.4,  where 
Table  10.3.4  Computolien  of  r: 


* V 

b * a i'  ‘ v* 

-7  -3 

“3  -1 

1 1 

4 0 

5 3 

(.4)  (~7,  -2.S  7.S4  9 

(.4)  (-3)  — 1.2  1.44  1 

(.4)  (l)  .4  .10  1 

(.4)  (4)  1.6  2.56  0 

(.4)  (5)  2.0  4.00  9 

0.0  16.00  20 

the  explained  and  observed  deviations  ere  squared  and  summed  to  give 
the  explained  and  total  variation,  respectively.  Upon  dividing  the  total 
variation,  20,  into  the  explained  variation,  10,  we  obtain  r*  *»  .SO.  This 
U the  final  measure  of  the  degree  of  association  between  tie  two  vari- 
ables: it  reveals  t'  at  SO  per  cent  of  the  variation  in  Y is  accounted  for 
by  its  linear  dependence  on  \\ 

RrccmhVibj  of  r!.  We  could  have  computed  the  regression  of  X on  Y 
as  entity  a*  the  regression  uf  Y on  X and  thereby  determined  the  pro- 
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portion  of  A'-variation  explained  by  In  fact,  such  a result  would 
appear  essential  to  a complete  statement  of  correlation.  Why  compute 
*»*  and  ignore  r;,?  The  answer  is  that  it  has  not  been  ignored,  although 
we  did  not  compute  it  independently.  It  is  simply  unnecessary  to  fit 
both  regression  lines  for  the  reason  that  r\t  = In  short,  r5  is  revers- 
ible. For  example,  knowing  that  (\, « .SO,  we  may  state  not  only  that 
income  explains  SO  per  cent  of  the  variation  in  social  status  but  (he  re- 
verse as  well,  namely,  that  social  status  accounts  for  SO  per  cent  of  the 
variation  in  income.  Since,  from  a purely  statistical  standpoint,  each 
explains  the  other  to  the  same  degree,  the  subscript  is  usually  not  attached 
to  rK 

Coefficient  of  Non-dcUrmwation-  Since  the  difference  between  total 
variation  and  explained  variation  is  necessarily  equal  to  the  unexplained 
variation,  it  follows  that  the  unexplained  proportion  is  simply  the  differ' 
ence  between  unity  and  r\  a quantity  appropriately  termed  the  Coefficient 
of  Xon-dctcrmihQtion.  Wc  may  write  it  as  follows: 

. _ 5 . _ Kxphined  Variation 

f Total  Variation 

We  could  have  obtained  this  quantity  directly  by  measuring  the  resid- 
uals around  the  regression  line,  equating  and  summing  them,  and  ox* 
pressing  this  sum  as  a proportion  of  total  variation.  In  fact,  this  is  the 
operational  meaning  of  l - r?.  But  whether  calculated  directly  from  the 
residuals,  or  indirectly  via  r3,  this  coefficient  may  always  be  construed 
as  the  proportion  of  variation  n the  dependent  variable  left  linearly 
unexplained  by  the  independent  variable.  It  thereby  gauges  the  strength 
of  the  unidentified  factors. 

Ptanonian  Producl-Mc:ntnt  r.  Conventionally,  it  is  r rather  than  r*  that 
is  selected  and  quoted  to  indicate  the  degree  of  correlation  between  the 
two  variables.  Why  is  this  so,  particularly  since  r • scoiu3  to  have  all  of 
the  prerequisites  for  a satisfactory  index  of  correlation:  it  ranges  between 
tero  and  unity;  it  has  a simple,  comprehensible  meaning?  Nevertheless, 
unless  another  index  is  specifically  named,  it  is  assumed  that  the  correla- 
tion coefficient  refers  to  Peawnian  r.  How  is  this  convention  explained? 

Before  formulating  an  answer  to  that  question,  we  must  first  acquaint 
ourselves  with  the  distinctive  propeuies  of  r.  Accordingly',  we  return 
once  again  to  the  scatter  diagram  and  the  line  of  best  fit.  Our  procedure 
trill  be  exactly  the  same  as  that  followed  previously,  excepting  that  the 
data  will  now  be  ptoltcd  as  stand  ud  deviates  (<r-vnlues),  instead  of  the 
observed  deviations  (Figure  10.3.1).  Again  the  configuration  of  points 
is  left  unaffected  by  the  transformation:  the  sole  alteration  U that  the 
scale  unit  is  now  the  standard  thuaticii  rather  than  the  original  raw 
unit  of  measure, 
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Fiovrk  10.3.4  ScclUr  Diajnm  of  Standard  Dtrialtt 


Table  10.3.5 

Deviation*  at  Standard 
Deviate* 


To  establish  the  lino  of  best  fit  through  this  scal'.er  of  ^-points,  we 
first  calculate  its  slope  as  before,  except  that  now  we  operate  on  the 
standard  deviates  instead  of  the  mw  deviations.  Consequently,  the  slope 
would  now  bo  written  in  standard  form: 
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instead  of: 

2zy 

X 

Since  - is  usually  symbolized 
os: 


and  symbolized  r,t  instead  of  b,t.  This  is  the  culmination  of  our  develop- 
ment. Wc  arc  now  in  a position  to  see  that  the  slope  of  the  regression 
line  through  the  scatter  of  c-points  is  identical  with  the  square  root  of 
explained  variation:  that  is,  r =»  V?.  For  the  example  above  (Table 
1C 3.4),  v^80-  ±.S9. 

To  compute  r directly,  we  would  proceed  as  in  Table  10.3.6,  where 


the  slope  formula  may  be  simply  written 


' ** 


iV 


(10.3.4) 


Table  10.5.6 

Pcarsonian  r as  ^fca>l  oj 
Cross-Products  of  Stand- 
ard Deviates 


** 

h 

— 1 .50 

-1.50 

2.310 

- .07 

- .60 

.333 

.22 

.50 

.110 

.SO 

.00 

.000 

1.12 

1.50 

1.6S0 

4.465 

r ** 

Zt.t,  4.40 
.V  “ 5 

S9 

paired  standard  deviates  are  multiplied  together  and  those  cross-products 
are  summed  and  averaged.  The  mean  of  tire  cress-products  is  t. 

Wo  can  now  understand  why  r was  designated  by  Karl  Pearson  "prod- 
uct-moment correlation  coefficient.”  From  the  composition  of  Formula 
10.3.4  above,  it  is  dear  that  r is  strictly  an  arithmetic  mean;  it  is  a sum 
divided  by  the  number  of  items  in  tint  sum.  Now,  the  terms  "mean" 
and  "moment”  ore  used  interclrangcably  in  mathematical  statistics; 
hence,  Pearson  labeled  the  mean  of  the  standard  cross-products  the 
product-moment  correlation  coefficient. 

Viewed  in  this  light,  r may  be  construed  as  the  mean  change  in  Y for 
every  unit  change  in  A’,  or  the  reverse,  always  assuming  measures  in 
standard  form,  l.o.,  expressed  in  terms  of  signia  unit*.  Thus*  if  a given 
income  deviates  by  Iff  from  its  mean,  wc  expect  the  associated  status 
scores  to  deviate  on  the  average  from  (heir  mean  by  ,SP<r.  Bjr  virtue  of 
this  principle,  we  may  estimate  Y (in  standard  form)  for  any  given 
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value  of  .V,  exactly  a 3 we  applied  bt,  to  obtain  the  expected  y-deviations. 
We  have  only  to  apply  the  formula: 

i , “ r(z-)  (10.3.5) 

where  i,  **  estimated  cr-vahie  in  Y 
•»  observed  a- value  in  X 

Substituting  our  data  in  this  formula,  wc  get  the  estimated  values  shown 
in  Table  10.3.7.  Thus,  when  income  is  1.12  in  standard  form,  social 


Table  10.3<7  Estimated  r,  from  Observed  r* 


u 

<r)*i 

K3 

4 

-1.5C 

tS9)(-l.5G) 

KS 

-1.39 

- .67 

(.$!>)(-  .67) 

*r 

- .60 

22 

(■SOX 

.22) 

<3 

.20 

.89 

me 

.SO) 

r* 

.79 

1.12 

(m  i.i2) 

w 

1.00 

status  Is  estimated  to  be  1.00;  when  income  is  — 1.56<r,  social  status  would 
be  — 1.39<r.  Perfect  correlation  between  two  scries  would  prevail  whenever 
the  paired  values  are  identical  sigma  distances  from  their  respective 
means  — for  example,  if  the  person  with  a sociology  grade  of,  say,  l.Sff 
above  the  mean  were  located  l.Sff  above  the  mean  of  the  history  grades, 
end  ell  other  observations  in  the  tv.*o  series  wete  also  perfectly  paired. 
Because  r is  reversible,  we  could  have  applied  r to  the  observed  *,’s  and 
thereby  estimated  the  corresponding  i,V 

Comparison  of  r and  r*.  On  the  surface,  the  difference  between  r and  r* 
appears  very  trivial:  a simple  detail  of  exponent,  with  both  values  easily 
convertible  Into  the  other,  after  one  has  been  computed.  Nevertheless, 
this  seemingly  innocent  difference  cannot  be  so  lightly  dismissed,  for  the 
two  r*a  respectively  focus  on  two  distinct  but  interrelated  aspects  of 
covariation  — a distinction  which  the  design  of  the  present  chapter  has 
deliberately  sought  to  portray. 

Either  one  could  have  been  mechanically  derived  from  the  other  by 
way  of  only  one  of  the  two  procedures.  However,  an  r*  derived  from  r 
would  never  convey  to  the  student  the  intent  and  meaning  of  explained 
variation  which  r*  measures.  The  "square  of  the  slope”  would  mean 
nothing  to  him.  Nor  would  an  r,  converted  arithmetically  from  r!,  con- 
vey the  intent  and  meaning  of  the  slope.  The  "square  root  of  explained 
variation"  could  not  be  visualised  slope.  Although  r and  r*  arc  inter- 
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dependent,  t*  measures  the  over-nil  proportion  of  the  total  variation  of 
one  variable  that  is  associated  with,  or  explained  hy,  tho  other. 

On  the  other  hand,  r measures  the  dynamic  aspect  of  this  relation, 
measuring  the  rate  of  change  in  one  variable  relative  to  the  other,  as 
has  been  demonstrated  in  the  pirad  ignis  above.  Because  of  this  con- 
ceptual distinction,  we  may  say  that  r is  primarily  a predictive  device 
to  forecast,  for  example,  the  expected  level  of  performance  on  one  vari- 
able from  observed  performance  on  another.  As  such,  it  is  probably 
more  likely  to  be  used  by  educators,  psychologists,  and  others  interested 
ill  personal  prediction,  rather  thin  by  sociologists.  On  the  other  hand, 
r*  is  a summariiing  measure  weighing  the  influence,  or  force,  exerted  by 
one  variable  on  the  other. 

Since  r is  slope,  it  manifestly  must  have  a direction:  predominantly 
up  or  down,  according  to  whether  the  variables  are  positively  or  nega- 
tively related.  It  follows  that  the  direction  of  the  slope  reflects  the  type 
of  relation,  which  is  then  symbol!  Jtd  by  a plus  or  minus  sign.  The  synop- 
tic measure  r*,  however,  carries  no  sign  since  it  expresses  a proportion 
of  a total  variation. 

It  is  now  clear  that  r and  r*  are  not  interchangeable;  nor  should  they 
be  cheaply  derived  from  each  other  until  their  structural  meanings  are 
thoroughly  understood.  Since  r is  always  larger  than  r’,  it  could  be 
deliberately  or  unwittingly  used  to  exaggerate  the  strength  of  associa- 
tion and  thereby  mislead  the  reader:  for  example,  r *»  .5  may  seem  to 
signify  a reasonably  strong  association,  but  r’  **  .25  indicates  that  only 
25  per  cent  of  the  variation  in  either  variable  is  accounted  for  by  the 
other.  When  the  emphasis  is  on  the  strength  of  the  over-all  relationship 
between  two  variables,  os  is  frequently  the  case  in  sociological  studies, 
r*  is  the  pertinent  statistic.  For  example,  the  finding  that  50  per  cent 
of  the  variation  in  orea  delinquency  may  be  attributed  to  the  economic 
factor  still  leaves  something  unexplained;  yet  it  represents  a tangible 
gain  in  understanding  a phenomenon  that  until  a century  ago  was  laid 
at  the  door  of  demons,  heredity,  cxr  free  will. 

In  sum,  each  measure  has  its  very  distinctive  connotations  and  ap- 
propriate uses.  Nevertheless,  r enjoys  a near  monopoly  over  r*.  This 
may  bo  due  partly  to  the  inertia  of  tradition,  which  will  perhaps  be  dis- 
sipated when  workers  become  mere  sensitive  to  the  nuances  of  quantita- 
tive reasoning. 

Computing  Formula*:  Ungroup  f Da fo.  The  extended  analytic  opera- 
tions previously  given  were  tlc-’^ucd  to  expose  the  logic  and  intent  of 
the  concepts  of  r and  r*,  so  that  they  could  be  discriminatingly  applied 
in  social  research.  But  such  elaborate  calculations  need  not  be  carried 
out  if  the  sole  object  is  to  determine  the  mere  numerical  ^otue  of  r or 
r*  in  a given  practical  problem.  There  are  l.andliook  working  procedures 
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that  obviate  the  need  to  compute  slope,  the  explained  deviation?,  the 
unexplained  residuals,  and  other  quantities  involved  in  the  formulas 
given  above.  Of  course,  such  mechanical  computing  methods  arc  not 
interpretive  tools:  they  arq  merely  recipes  that  can  be  followed  by  any 
intelligent  clerk  or  machine. 

The  computing  programs  here  given  begin  with  the  product-moment 
formula,  which  may  be  rewritten  for  somewhat  simpler  calculation: 


Xry 


(10.3.G) 


And  it  may  be  still  more  conveniently  rewritten,  particularly  when  a 
desk  calculator  is  available,  in  terms  of  the  original,  untransformed  values: 

r - (10.3.7) 

VEFtSX*  - (2A)OOVSP-  (21')*] 

This  is  the  computing  formula  for  r,  ungrouped  data.  It  intends  to 
eliminate  many  of  the  vexatious  arithmetic  details  that  ere  sure  to  arise 
when  the  operation  i3  carried  out  on  the  raw  deviations  around  means. 

This  formula,  be  it  noted,  requires  only  five  sums  and  correspondingly 
five  work  columns.  Two  of  these  columns  comprise  tho  original  tabula- 
tion of  .V  and  Y,  so  that  only  three  more  are  needed  in  order  to  proceed 
with  the  calculation  of  r.  Substituting  sums  in  the  formula,  we  again 
obtain  r .89. 
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Worktable  for  Calcuta- 
Table  10.3.8  lion  of  rfrom  Ur, grouped 
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Y 
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5 

49 

25 

35 

11 

7 
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49 

77 

14 

6 
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36 

$4 

15 

9 
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31 
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2 * SO 

30 
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CAUSATION  AND  THE  LAW 


One  of  the  most  Important  elements  in  the  attribution  of  legal 
responsibility  for  negligence  in  cases  involving  torts  is  the  causal 
relation  between  defendant's  fault  and  plaintiff's  injury.  In  the  law, 
however,  as  Harper  and  James  point  out  in  the  first  of  the  following 
two  readings,  there  has  developed  the  concept  of  "proximate"  or 
"legal"  cause,  which  differs  from  the  scientific  meaning  of  cause 
In  that  it  limits  and  restricts  liability.  The  result,  they  note,  has 
been  a widely  recognized  confusion. 

It  would  be  insufficient,  however,  merely  to  state  that  proxi- 
mate cause  is  a legal  device  to  restrict  liability;  the  student  should 
also  note  the  fundamental  difference  between  the  ways  in  which  the 
lawyer  and  scientist  approach  causation.  The  selection  from  Hart 
and  Honore  discusses  this  difference.  According  to  the  authors, 
the  kinds  of  inquiries  made  by  scientists  are  explanatory  while  thore 
of  the  lawyer  are  attributive.  The  scientist's  concern  is  to  isolate, 
as  far  as  possible,  all  antecedents  of  a given  consequence,  for 
example,  in  the  construction  of  experiments.  The  lawyer,  on  the 
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other  hand,  pursues  a much  less  ambitious  goal,  focusing  on  whether 
a particular  defendant's  conduct  was  culpable.  In  other  words,  there 
may  well  be  multiple  causes  of  an  event,  and  It  may  be  the  task  of 
science  to  disentangle  this  complex  network;  but  for  the  purposes 
of  the  law,  it  is  sufficient  to  establish  the  nature  of  the  defendant's 
contribution. 


From  H,  L.A.  Hart  and  A. M.  Honore,  Causation  iji  the 
Law  (Oxford:  The  Clarendon  Press,  1959)  pp.  26-30; 
64-78.  Reprinted  by  permission. 

From  Fowler  V.  Harper  and  Fleming  James,  Jr. , The  Law 
of  Torts  (Boston:  Little,  Brown,  & Co. , Inc.,  1956) 
pp.  1108-1121.  Reprinted  by  permission. 
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H.L.A.  Hart  and  A.M.  Honore,  Causation  in  the  Law 
(Oxford:  Oxford  University  Press,  195977  pp.  64~78. 


‘To  consequences  no  limit  can  be  s;t*:  'Every  event  which  would 
not  have  happened  if  an  earlier  event  hid  not  happened  is  the  con- 
sequence of  that  earlier  event.'  These  two  propositions  are  not  equiva- 
lent in  meaning  and  arc  not  equally  or  in  the  same  way  at  variance 
with  ordinary  thought.  They  have,  however,  both  been  urged  some- 
times in  the  same  breath  by  the  legal  theorist1  and  the  philosopher: 
they  a re  indeed  sometimes  said  by  lawyers  to  be  'the  philosophical 
doctrine*  of  causation.  It  is  peihaps  noi  difficult  even  for  the  layman 
to  accept  the  first  proposition  as  a truth  about  certain  physical  events; 
an  explosion  may  cause  a flash  of  light  which  will  be  propagated  as 
far  as  the  outer  nebulae;  its  effects  or  consequences  continue  in- 
definitely. It  is,  however,  a different  matter  to  accept  the  view  that 
whenever  a man  is  murdered  with  a gun  his  death  was  the  consequence 
of  (still  less  an  ‘effect  ’ of  cr  ‘ caused  by’)  the  manufacture  of  the  bullet. 
The  first  tells  a perhaps  unfamiliar  tale  about  unfamiliar  events;  the 
second  introduces  an  unfamiliar,  though,  of  course,  a possible  w’ay  of 
speaking  about  familiar  events.  It  is  not  that  this  unrestricted  use  of 
‘consequence'  is  unintelligible  or  never  found;  it  is  indeed  used  to  refer 
to  bizarre  or  fortuitous  connexions  or  coincidences:  but  the  point  is 
that  the  various  causal  notions  employed  for  the  purposes  of  explana- 
tion, attribution  of  responsibility  or  the  assessment  of  contributions 
to  the  course  of  history  carry  with  them  implicit  limits  which  are 
similar  in  these  different  employments. 

It  is,  then,  the  second  proposition,  defining  consequence  in  terms 
of ‘necessary  condition’,  with  which  theorists  are  really  concerned. 
This  proposition  is  the  corollary  of  the  view  that,  if  we  look  into  the 
past  of  any  given  event,  there  is  an  infinite  number  of  events,  each  of 
which  is  a necessary  condition  of  the  given  event  and  so,  as  much  as 
any  other,  is  its  cause.  This  is  the  'cone’2of  causation,  so-called  because, 
since  any  event  has  a number  of  simultaneous  conditions,  the  series 
fans  out  as  we  go  back  in  time.  The  justification,  indeed  only  partial, 
for  calling  this  ‘the  philosophical  doctrine*  of  causation  is  that  it  re- 
sembles Mill’s  doctrine  that 1 we  have  no  right  to  give  the  name  of  cause 
tooneoftheconditionsexclusiveof  theothers  of  them’.  ItdifTers  from 

1 Lawson,  Negligence  in  the  Civil  La w,  p.  33. 

* Glanvillc  SViUiams,  Joint  Tort:  ar.d  Contributory  Negligence,  p.  239. 
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Mill's  view  in  taking  the  essence  of  causation  to  be  ‘necessary  con- 
dition’ and  not  * the  sum  total'1  of  the  suTtcient  conditions  of  an  event. 
Legal  theorists  have  developed  this  account  of  cause  and  con- 
sequence to  show  what  is  'factual’,  ‘objective*,  or  ‘scientific*  in 
these  notions*  this  they  call  ‘cause  i , fact’  and  it  is  usually  stressed  as 
a preliminary  to  the  doctrine  that  any  more  restricted  application  of 
these  terms  in  the  law  represents  nothing  in  the  facts  or  in  the  meaning 
of  causation,  but  expresses  fluctuating  legal  policy  or  sentiments  of 
what  is  just  or  convenient.  Moral  philosophers  have  insisted  in  some- 
what similar  terms  that  the  consequences  of  human  action  are 
‘infinite’;  this  they  have  urged  as  an  objection  against  the  Utilitarian 
doctrine  that  the  rightness  of  a morally  right  action  depends  on 
whether  its  consequences  are  better  than  those  of  any  alternative 
action  in  the  circurnsta  nces.  ‘ We  should  have  to  trace  as  far  as  possible 
the  consequences  not  only  for  the  persons  affected  directly  but  also 
for  those  indirectly  affected  and  to  these  no  limit  can  be  set/2  Hence, 
so  the  argument  runs,  v/c  cannot  either  inductively  establish  the 
Utilitarian  doctrine  that  right  acts  are‘optimific’  or  use  it  in  particular 
cases  to  discover  what  is  right.  Yet,  however  vulnerable  at  other 
points  Utilitarianism  may  be  as  an  account  of  moral  judgment,  this 
objection  seems  to  rest  on  a mistake  as  to  the  sense  of  ‘consequence’. 
The  Utilitarian  assertion  that  the  rightness  of  an  action  depends  oa 
its  consequences  is  not  the  same  as  the  assertion  that  it  depends  on 
all  those  later  occurrences  which  svould  not  have  happened  had  the 
action  not  been  done,  to  which  indeed  ‘no  limit  can  be  set\  It  is 
important  to  see  that  the  issue  here  is  no;  the  linguistic  one  whether 
the  word  ‘consequence'  would  be  understood  if  used  in  this  way.  The 
point  is  that,  though  we  could,  we  Jo  not  think  in  this  way  in  tracing 
connexions  between  human  actions  and  events.  Instead,  whenever 
we  are  concerned  with  such  connexions,  whether  for  the  purpose  of 
explaining  a puzzling  occurrence,  assessing  responsibility,  or  giving 
an  intelligible  historic?!  narrative,  we  employ  a set  of  concepts  restrict- 
ing in  various  ways  what  counts  as  a consequence.  These  restrictions 
colour  all  our  thinking  in  causal  terms;  when  we  find  them  in  the  law 
we  are  not  finding  something  invented  by  or  peculiar  to  the  law, 
though  of  course  it  is  for  the  law  to  say  when  and  how  far  it  will  use 
them  and,  where  they  are  vague,  to  supplement  them. 

No  short  account  can  be  given  of  the  limits  thus  placed  on  ‘con- 
sequences* because  these  limits  vary,  intelligibly,  with  the  variety  of 
causal  connexion  asserted.  Thus  we  may  be  tempted  by  the  generaliza- 
tion that  consequences  must  always  be  something  intended  or  fore- 
seen or  at  least  foreseeable  with  ordinary  care:  but  counter-examples 
spring  up  from  many  types  of  context  where  causal  statements  are 
* Milt,  Eook  III,  chap,  v,  s,  2,  * Ross,  The  Right  aj  the  Good,  p.  36. 
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made.  If  smoking  is  shown  to  cause  lung  cancer  this  discovery  will 
permit  us  to  describe  past  as  well  as  future  cases  of  cancer  as  the 
effect  or  consequence  of  smoking  even  though  no  one  foresaw  or  had 
reasonable  grounds  to  suspect  this  in  the  past.  What  is  common  and 
commonly  appreciated  and  hence  foreseeable  certainly  controls  the 
scope  of  consequences  in  certain  varieties  of  causal  statement  but  not 
in  all.  Again  the  voluntary  intervention  of  a second  person  very  often 
constitutes  the  limit.  If  a guest  sits  down  at  a table  laid  with  knife 
and  fork  and  piunges  the  knife  into  his  hostess's  oreast,  her  death  is 
not  in  any  context  thought  of  as  caused  by,  or  the  effect  or  result  of 
the  waiter's  action  in  laying  the  table;  nor  would  it  be  linked  with  this 
action  as  its  consequence  for  any  of  the  purposes,  explanatory  or 
attributive,  for  which  we  employ  causal  notions.  Yet  as  we  have  seen 
there  arc  many  other  types  of  case  where  a voluntary  action  or  the 
harm  it  does  are  naturally  attributed  to  some  prior  neglect  of  pre- 
caution as  its  consequence.  Finally,  we  may  think  that  a simple 
answer  is  already  supplied  by  Hume  and  Mill's  doctrine  that  causal 
connexion  rests  on  general  laws  asserting  regularconnexion;yet,even 
in  the  type  of  case  to  which  this  important  doctrine  applies,  reference 
to  it  alone  will  not  solve  our  problem.  For  we  often  trace  a causal 
connexion  between  an  antecedent  and  a consequent  which  themselves 
very  rarely  go  together:  wc  do  this  when  the  case  can  be  broken 
down  into  intermediate  stages,  which  themselves  exemplify  different 
generalizations,  as  when  we  find  tuat  the  fall  of  a tile  was  the  cause  of 
someone's  death,  rare  though  this  be.  Here  our  problem  reappears  in 
the  form  of  the  question:  When  can  generalizations  be  combined  in 
this  way? 

We  shall  examine  first  the  central  type  of  case  where  the  problem 
is  of  this  last-mentioned  form.  Here  the  gist  of  the  causal  connexion 
lies  in  the  general  connexion  with  each  other  of  the  successive  stages; 
and  is  not  dependent  on  the  special  notions  of  one  person  providing 
another  with  reasons  or  exceptional  opportunities  for  actions.  This 
form  of  causal  connexion  may  exist  between  actions  and  events,  ana 
between  purely  physical  events,  and  it  is  in  such  cases  that  the  words 
‘cause1  and  ‘causing’  used  of  the  antecedent  action  or  event  have 
their  most  obvious  application.  It  is  convenient  to  refer  to  cases  of 
the  first  type  where  the  consequence  is  harm  as  cases  of  ‘causing 
harm*,  and  to  refer  to  cases  where  harm  is  the  consequence  of  one 
person  providing  another  with  reasors  or  opportunities  for  doing 
harm  as  cases  of  ‘inducing',  ‘advising*,  or  ‘occasioning’  harmful 
acts.1  In  cases  of  the  first  type  a voluntary  act,  or  a conjunction  of 
events  amounting  to  a coincidence,  operates  as  a limit  in  the  sense  that 
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events  subsequent  to  these  are  not  attributed  to  the-  antecedent  action 
or  event  as  its  consequence  even  though  they  would  not  have  happened 
without  it.  Often  such  a limiting  action  or  coincidence  is  thought  of 
and  described  as  ‘intervening’:  and  lawyers  speak  of  them  as  ‘super- 
seding’ or  ‘extraneous’  causes  ‘breaking  the  chain  of  causation’. 
To  sec  what  these  metaphors  rest  on  (and  in  part  obscure)  and  how 
such  factors  operate  as  a limit  we  sha'l  consider  the  detail  of  three 

simple  cases.  .... 

(t)  A forest  fire  breaks  out,  and  later  investigation  shows  that 
shortly  before  the  outbreak  A had  flung  away  a lighted  cigarette  into 
the  bracken  at  the  edge  of  the  forest,  the  bracken  caught  fire,  a tight 
breeze  got  up,  and  fanned  fhe  flumes  in  thedirectionof  the  forest.  If, 
on  discovering  these  facts,  we  hesitate  before  saying  that  A's  action 
caused  the  forest  fire  this  would  be  to  consider  the  alternative  hypo- 
thesis that  in  spite  of  appearances  the  fire  only  succeeded  A’s  action  in 
point  of  time,  that  the  bracken  flickered  out  harmlessly  and  the  forest 
fire  was  caused  by  soinething  else.  To  dispose  of  this  it  may  be  neces- 
sary to  examine  in  further  detail  the  process  of  events  between  the 
ignition  of  the  bracken  and  the  outbreak  of  fire  in  the  forest  and  ;o 
show  that  these  exemplified  certain  types  of  continuous  change.  If 
this  is  shown,  there  is  no  longer  any  room  for  doubt:  A’s  action  wo* 
the  cause  of  the  fire,  whether  he  intended  it  or  not.  This  seems  and  is 
the  si-.tplcst  ofcases.  Yet  it  is  important  to  notice  that  even  in  applying 
our  general  knowledge  to  a case  as  simple  as  this,  indeed  in  regarding 
it  as  simple,  we  make  an  implicit  use  of  a distinction  between  types 
of  factor  which  constitute  a limit  in  tracing  consequences  and  those 
which  we  regard  as  mere  circumstances  ‘through’  which  we  trace 
them.  For  the  breeze  which  sprang  up  after  A dropped  the  cigarette, 
and  without  which  the  fire  wouldjiot  have  spread  to  trie  forest,  was 
not  only  subsequent  to  his  ^action  but  entirely  independent  of  it^it- 
was,  however,  a common  recurrent  teature"of  there nvironment,  and, 
as  such,  it  is  thought  of  not  as  an  ‘intervening’  force  but  as  merely 
part  of  the  circumstances  in  which  the  cause  ‘ operates’.  The  decision 
so  to  regard  it  is  implicitly  taken  when  we  combine  our  knowledge  of 
the  successive  stages  of  the  process  and  assert  the  connexion. 

It  is  easy  here  to  be  misled  by  the  natural  metaphor  of  a causal 
‘chain’,  which  may  lead  us  to  think  that  the  causal  process  consists  of  a 
series  of  single  events  each  of  which  is  dependent  upon^ (would  not 
have  occurred  without)  its  predecessor  in  the  ‘chain’  and  so  is 
dependent  upon  the  initiating  action  or  event.  In  truth  in  any  causal 
process  we  have  at  each  phase  not  single  events  but  complex  sets  of 
conditions,  and  among  these  conditions  are  some  which  are  not  only 
subsequentto.butindependentof  the  initiating  action  or  event.  Some 
of  these  independent  conditions  such  ns  the  evening  breeze  in  the 
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example  chosen,  \sc  classify  i*s  mere  conditions  in  or  on  which  the 
cause  operates;  others  we  speak  of  as  ‘interventions*  or  ‘causes’.  To 
decide  how  such  independent  elements  shall  be  classified  is  also  to 
decide  how  we  shall  combine  our  knowledge  of  the  different  general 
connexions  which  the  successive  stages  exemplify,  and  it  is  important 
to  see  that  nothing  in  this  knowledge  itself  can  resolve  this  point.  We 
may  have  to  go  to  science  for  the  relevant  general  knowledge  before 
we  can  asser^with  proper  confidence  that  A's  action  did  cause  the 
fire,  but  science,  though  it  tells  us  that  an  air  current  was  required,  is 
silent  on  the  difference  between  a current  in  the  form  of  an  evening 
breeze  and  one  produced  by  someone  who  deliberately  fanned  the 
flames  as  they  were  flickering  out  in  the  bracken.  Yet  an  air  current 
in  thisjformjs  not  a ‘condition*  or 1 mere  circumstance' through  which 
we  can  trace  the ’consequence;  its' presence  Ts^uld  force us’  tore  vise 
the  &$se7ffo7u  that  A caused  the  fire."  ConverieTy~if  seferief  helped 
us  to  identify  as  a necessary  factorin  producing  the*  fire  some  con* 
dition  or  element"  of  which"  we  had  previou  sly'  been  "totally  ignorant, 
e,g.  the  persistence  of  oxygen,  this  would  leave  our  original  judgment 
undisturbed  if  this  factor  were  a common  or  pervasive  feature  of  the 
environment  or  of  the  thing  in  question.  There  is  thus  indeed  an  im- 
portant sense  in  which  it  is  true  that  the  distinctions  between  cause 
and  conditions  is  not  a ‘scientific’  one.  It  is  not  determined  by  laws 
or  generalizations  concerning  connexions  between  events. 

When  we  have  assembled  all  our  knowledge  of  the  factors  involved 
i,.  the  fire,  the  residual  question  which  we  then  confront  {the  attributive 
question)  rnay  be  typified  as  follows:  Here  is  A’s  action,  here  is  the 
fire:  can  the  fire  be  attributed  to  .Vs  acticn  as  its  consequence  given 
that  there  is  also  this  third  factor  (the  breeze  or  B‘s  intervention) 
without  vvhich  the  fire  would  not  have  happened?  It  is  plain  that,  both 
in  raising  questions  of  this  kind  and  in  answering  them,  ordinary 
thought  is  powerfully  influenced  by  the  analogy  between  the  straight- 
forward cases  of  causal  attribution  (where  the  elements  required  for 
the  production  of  harm  in  addition  to  the  initiating  action  are  all 
‘normal’  conditions)  and  even  simpler  cases  of  responsibility  which 
we  do  not  ordinarily  describe  in  causal  language  at  all  but  by  the 
simple  transitive  verbs  of  action.  These  are  the  cases  of  the  direct 
manipulation  of  objects  involving  changes  in  them  or  their  position: 
cases  where  we  say  ‘He  pushed  it’,  ‘He  broke  it’,  ‘He  bent  it.’  The 
cases  which  we  do  confidently  describe  in  causal  language  (‘The  fire 
was  caused  by  his  carelessness’, ‘He  caused  a fire’)  are  cases  where  no 
other  human  action  or  abnormal  occurrence  is  required  for  the  pro- 
duction of  the  effect,  but  only  normal  conditions.  Such  cases  appear 
as  mere  long  range  or  less  direct  versions  or  extensions  of  the  most 
obvious  and  fundamental  case  of  all  for  the  attribution  of  responsi* 


H.L.A.  Hart  and  A.M.  Honore 


biKty:  the  case  where  we  can  simply  say  *He  did  it/  Conversely  in 
attaching  importance  to  thus  causing  harm  as  a distinct  ground  of 
responsibility  and  in  taking  certain  kinds  of  factor  (whether  human 
interventions  or  abnormal  occurrences),  without  which  the  initiating 
action  would  not  have  led  to  harm,  to  preclude  the  description  of  the 
case  in  simple  causal  terms,  common  sense  is  affected  by  the  fact  that 
here,  because  of  the  manner  in  which  the  harm  eventuates,  the  out- 
come cannot  be  represented  as  a mere  extension  of  the  initiating 
action;  the  analogy  with  the  fundamental  case  for  responsibility 
(‘He  did  it'j'has  broken  down. 

When  we  understand  the  power  exerted  over  our  ordinary  thought 
by  the  conception  thatcausing  harm  is  a mere  extension  of  the  primary 
case  of  doing  harm,  the  inter-related  metaphors  which  seem  natural  to 
lawyers  and  laymen,  in  describing  various  a$pe:ts  ofcausat  connexion, 
fall  into  place  and  we  can  discuss  their  factual  basi^.  The  persistent 
notion  that  some  kinds  of  event  required  in  addition  to  the  initia- 
ting action  for  the  production  of  harm  ‘break  the  chain  of  causation’ 
is  intelligible,  if  we  remember  that  though  such  events  actually  com- 
plete the  explanation  of  the  harm  (and  so  make  rather  than  break  the 
causal  explanation)  they  do,  unlike  mere  normal  conditions,  break  the 
analogy  with  case*  of  simple  actions.  The  same  analogy  accounts  for 
the  description  of  these  factors  as  ‘new  actions’  {novas  actus)  or 
‘new causes’,  ‘superseding",  ‘extraneous’,  ‘intervening  forces’:  and 
for  the  description  of  the  initiating  action  w hen  ‘ the  chain  of  causa- 
tion’ is  broken  as  ‘no  longer  operative’,  ‘having  worn  out*,  functus 
officio.1  So  too  when  the  ‘chain*  is  held  not  to  be  ‘broken’  the  initia- 
ting action  is  said  to  be  still  ‘potent*,-  ‘continuing’,  ‘contributing’, 
‘operative’,  and  the  mere  conditions  held  insufficient  to  break  the 
chain  are  ‘part  of  the  background*,*  ‘circumstances  in  which  the 
cause  operates’,4  ‘the  stage  set’,  ‘part  of  the  history’. 

(ii)  A throws  a lighted  cigarette  into  the  bracken  which  catches 
fire.  B,  just  as  the  flames  are  about  to  flicker  out,  deliberately  pours 
petrol  on  them.  The  fire  spreads  and  burns  down  the  forest.  A’s  action, 
whether  or  not  he  intended  the  forest  fire,  was  not  the  cause  of  the 
fire:  B’s  was. 

The  voluntary  intervention  of  a second  human  agent,  as  in  this 
case,  is  a paradigm  among  those  factors  which  preclude  the  assimi- 
lation in  causal  judgments  of  the  first  agent’s  connexion  writh  the 
eventual  harm  to  the  case  of  simple  direct  manipulation.  Such  an 
intervention  displaces  the  prior  action's  title  to  be  called  the  cause  and, 

1 Davits  v.  Swan  Motor  Co.,  (1947}  2 KB.  29),  313. 

» Minister  of  Pensions  v.  Cwnr.rH,  11947}  K.B.  750,  256.  Lord  Wright  (1950),  15 
Mod.  L.R.  3. 

1 Norris  v.  William  Moss  4 Son  Ltd (1954)  1 W.L.R.  46,  351. 

4 Minister  of  Pensions  v,  ChsnneU,  {1947J  K.3.  250,  256. 
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in  the  persistent  metaphors  found  in  the  law,  it  ‘reduces*  the  earlier 
action  and  its  immediate  etTects  to  the  level  of ‘mere  circumstances’ 
or  ’part  of  the  history*’  B in  this  case  was  not  an  ‘instrument’ 
through  which  A worked  or  a victim  of  the  circumstances  A has 
created,  He  has,  on  the  contrary,  freely  exploited  the  circumstances 
and  brought  about  the  fire  without  the  co-operation  of  any  further 
agent  or  any  chance  coincidence.  Compared  with  this  the  claim  of 
A’s  action  to  be  ranked  the  cause  of  the  fire  fails.  That  this  and  not  ;he 
moral  appraisal  of  the  two  actions  is  the  point  of  comparison  seems 
clear.  If  A and  B both  intended  to  set  the  forest  on  fire,  and  this 
destruction  is  accepted  as  something  wrong  or  wicked,  their  moral 
wickedness,  judged  "by  the  criterion,  of  intention,  is  the  same.  Yet  the 
causa!  judgment  differentiates  between  them.  If  their  moral  guilt  is 
judged  by  the  outcome,  this  judgment  though  it  would  differentiate 
between  them  cannot  be  the  source  of  the  causal  judgment;  for  it 
presupposes  it.  The  difference  just  is  that  B has  caused  the  harm  and 
A has  not,  Again,  if  we  appraise  these  actions  as  good  or  bad  from 
different  points  of  view,  this  leaves  the  causal  judgments  unchanged. 
A may  be  a soldier  of  one  side  anxious  to  burn  down  the  enemy's 
hide-out:  B may  be  an  enemy  soldier  who  has  decided  that  his  side 
is  too  iniquitous  to  defend.  Whatever  is  the  moral  judgment  passed 
on  these  actions  by  different  speakers  it  would  remain  true  that  A had 
not  caused  the  fire  and  B had. 

Th  ere  are,  as  we  have  said,  situations  in  which  a voluntary'  action 
would  not  be  thought  of  as  an  intervention  precluding  causal  con- 
nexion in  this  way.  These  are  the  cases  discussed  further  below  where 
an  opportunity  commonly  exploited  for  harmful  actions  is  negligently 
provided,  or  one  person  intentionally  provides  another  with  a certain 
type  of  reason  for  wrongdoing.  Except  in  such  cases  a volunary  inter- 
vention is  a limit  past  which  consequences  are  not  traced.  By  contrast, 
actions  which  in  any  of  a variety  of  different  ways  are  less  than  fully 
voluntary  are  assimilated  to  the  means  by  which  or  the  circumstances 
in  which  the  earlier  action  brings  about  the  consequences.  -Such 
actions  are  not  the  outcome  of  an  informed  choice  made  without 
pressure  from  others,  and  the  different  ways  in  which  human  action 
may  fall  short  in  this  respect  range  from  defective  muscular  control, 
through  lack  of  consciousness  or  knowledge,  to  the  vaguer  notions 
of  duress  and  of  predicaments,  created  by  the  first  agent  for  the 
second,  in  which  there  is  no  ‘fair’  choice. 

In  considering  examples  of  such  actions  and  their  bearing  on  causal 
judgments  there  are  three  dangers  to  avoid.  It  would  be  folly  to  think 
that  in  tracing  connexions  through  such  actions  instead  of  regarding 
them,  like  voluntary  interventions,  as  a limit,  ordinary  thought  has 
clearly  separated  out  their  non-voluntary  aspect  from  others  by 
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which  they  are  often  accompanied.  Thus  even  in  the  crude  case  where 
A lets  offa  gun  (intentionally  or  not)  and  startles  B,  so  that  he  makes  an 
involuntary  movement  of  his  arm  which  breaks  a class,  the  common- 
ness of  si.ch  a reaction  as  much  as  its  compulsive  character  may 
influence  the  judgment  that  A's  action  was  the  cause  of  the  damage. 

Secondly  we  must  not  impute  to  ordinary  thought  all  the  fine 
discriminations  that  could  be  made  and  in  tact  are  to  be  found  in  a 
legal  system,  or  an  equal  willingness  to  vupply  answers  to  complex 
questions  in  causal  terms.  Where  there  is  no  precisesystam  of  punish- 
ment, compensation  or  reward  to  administer,  ordinary  men  will  not 
often  have  faced  such  questions  as  whether  the  injuries  suffered  by  a 
motorist  who  collides  with  another  in  swerving  to  avoid  a child  are 
consequences  attributable  to  the  neglect  of  the  child's  parents  in 
allowing  it  to  wander  on  to  the  road.  Such  questions  courts  have  to 
answer  and  in  such  cases  common  judgments  provide  only  a general, 
though  still  an  important  indication  of  what  are  the  relevant  factors. 

Thirdly,  though  very  frequently  non-voluntary  actions  arc  assimi- 
lated to  mere  conditions  or  means  by  which  the  first  agent  brings 
about  the  consequences,  the  assimilation  is  never  quite  complete. 
This  is  manifested  by  the  general  avoidance  of  many  causal  locu- 
tions which  are  appropriate  when  the  consequences  are  traced  (as 
in  the  first  case)  through  purely  physical  events.  Thus  even  in  the 
case  in  which  the  second  agent’s  role  is  hardly  an  ‘action’  at  all,  e.g. 
where  A hits  B,  who  staggers  against  a glass  window  and  breaks  it,  we 
should  say  that  A’s  blow  made  B stagger  and  break  the  glass,  rather 
than  that  A’s  blow  caused  the  glass  to  break,  though  in  any  explana- 
tory or  attributive  context  the  case  would  be  summarized  by  saying 
that  A’s  action  was  the  cause  of  the  damage  or  that  A had  caused  it. 

In  the  last  two  cases  where  B’s  movements  are  involuntary  in 
the  sense  that  they  are  not  part  of  any  action  which  he  chose  or 
intended  to  do,  their  connexion  with  A’s  action  would  be  described 
by  saying  that  A's  blow  made  B stagger  or  caused  him  to  stagger  or 
that  the  noise  of  A’s  shot  made  him  jump  or  caused  him  to  jump. 
This  would  be  true,  whether  A intended  or  expected  B to  react  in  this 
way  or  not,  and  the  naturalness  of  treating  A’s  action  as  the  cause  of 
the  ultimate  damage  is  due  to  the  causal  character  of  this  part  of  the 
process  involving  B*s  action.  The  same  is  however  true  where  B’s 
actions  are  not  involuntary  movements  but  A is  considered  to  have 
made  or  caused  B to  do  them  by  less  crude  means.  This  is  the  case  if, 
for  example,  A uses  threats  or  exploits  his  authority  over  B to  make 
B do  something,  e.g.  knock  down  a door.  At  least  where  A’s  threats 
are  ofsericus  harm,  or  B’s  act  was  unquestionably  within  A’s  authority 
to  order,  he  too  has  made  or  forced  or  (in  format  quasidegal  parlance) 
‘caused’  B to  act. 
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Outside  the  area  of  such  cases,  where  B’s  will  would  he  cither  said 
not  to  be  involved  at  all,  or  to  be  overborne  by  A,  arc  cases  w here  A’s 
act  creates  a predicament  for  B narrowing  the  area  of  choice  so  that 
he  has  either  to  inflict  some  harm  on  himself  or  others,  or  sacrifice 
some  important  interest  or  duty.  Such  cases  resemble  coercion  in 
that  A narrows  the  area  of  B’s  choice  but  differ  from  it  in  that  this 
predicament  need  not  be  intentionally  created.  A sets  a house  on  fire 
(intentionally  or  unintentionally):  B to  save  himself  has  to  jump  from 
£ height  involving  certain  injury,  or  to  save  a child  rushes  in  and  is 
scriourly  bunud  Hero  of  course  B’s  movements  arc  not  involuntary; 
the  ’necessity’  of  his  action  is  here  of  a different  order.  His  action  is 
the  outcome  of  a choice  between  two  evils  forced  on  uir^  by  A’s 
action.  In  such  cases,  when  B’s  injuries  are  thought  of  as  the  con- 
sequence  of  the  fire,  the  implicit  judgment  is  made  that  his  action  was 
the  lesser  of  two  eviu  and  in  this  sense  a ‘reasonable’  one  which  he 
was  obliged  to  make  to  avoid  the  greater  evil  This  is  often  paradoxic- 
ally, though  understandably,  described  by  saying  that  here  the  agent 
‘had  no  choice'  but  to  do  what  he  did.  Such  judgment  involve  a 
comparison  of  the  importance  of  the  respective  interests  sacrificed 
and  preserved,  and  the  final  assertion  that  A’s  action  was  the  cause 
of  the  injuries  rests  on  evaluations  about  which  men  rnay  differ. 

Finally,  the  ground  for  treating  some  harm  which  would  not  have 
occurred  v/ithout  B’s  action  as  the  consequence  of  A*$  action  may 
bo  that  B acted  in  ignorance  of  or  under  a mistake  ns  to  some 
feature  of  the  situation  created  by  A.  Poisoning  offers  perhaps  the 
simplest  example  of  the  bearing  on  causal  judgments  of  actions  which 
are  less  than  voluntary  in  this  Aristotelian  sense.  If  A intending  B's 
death  deliberately  poisons  B's  food  and  B,  knowing  this,  deliberately 
takes  the  poison  and  dies,  A has  not  caused  B’s  death:  if  however  B 
does  not  know  the  food  to  be  poisoned,  eats  it  and  dies  A has  caused 
his  death,  even  if  he  put  the  poison  in  unwittingly.  Of  course  only  the 
roughestjudgmentsare  passed  incausa!  terms  in  such  cases  outsidelaw 
courts,  where  fine  degrees  of ‘appreciation’  or  reckless  shutting  of  the 
eyes,  may  have  to  be  discriminated  from ‘full  knowledge’-  Yet,  rough 
as  these  are,  they  indicate  clearly  enough  the  controlling  principles. 

Though  in  the  foregoing  cases  A’s  initiating  action  might  often  be 
described  as  ‘the  cause’  of  the  ultimate  harm,  this  linguistic  fact  is  of 
subordinate  importance  to  the  fact  that,  for  whatever  purpose,  ex- 
planatory, descriptive  or  evaluative,  consequences  of  an  ncticn  are 
traced,  discriminations  are  made  (except  in  die  cases  discussed  later) 
between  free  voluntary'  interventions  and  less  than  voluntary  reactions 
to  the  first  action  or  the  circumstances  created  by  it. 

(iii)  The  analogy  with  single  simple  actions  which  guides  the 
tracing  of  consequences  may  be  broken  by  certain  kinds  of  conjunc- 
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tions  of  physical  events. . hits  B who  falls  to  the  ground  stunned  and 
bruised  by  the  blow;  at  that  momeni  a tree  crashes  to  the  ground  and 
kills  B.  A has  certainly  caused  B’s  bruises  bur  not  his  death:  for 
though  the  fall  of  the  tree  was,  like  the  evening  breeze,  in  our  earlier 
example,  independent  of  and  subsequent  to  the  initiating  action,  it 
would  be  differentiated  from  the  breeze  in  any  description  in  causa! 
terms  of  the  connexion  of  B’s  death  with  A’s  action.  It  is  to  be 
noticed  that  tills  is  not  a matter  which  turns  on  the  intention  with 
which  A struck  B.  Even  if  A hit  B inadvertently  or  accidentally  his 
blow  would  still  be  the  cause  of  B’s  bruises:  he  would  have  caused 
them  though  unintentionally.  Conversely  even  if  A had  intended  his 
Wow  to  kill,  this  would  have  been  an  attempt  to  kill  but  still  not  the 
cause  of  B’s  death.  On  this  legal  and  ordinary  judgments  would  be 
found  to  agree;  and  most  legal  systems  would  distinguish  for  the 
purposes  of  punishment1  an  attempt  with  a fatal  upshot,  issuing  by 
such  chance  oranomalousevents,  from  ’causing  death* — the  terms  in 
which  the  offences  of  murder  and  manslaughter  are  usually  defined. 

Similarly  the  causal  description  of  the  case  does  not  turn  on  the 
moral  appraisal  of  A’s  action  or  the  wish  to  punish  it.  A may  be  a 
robber  and  a murderer  and  B a saint  guarding  the  place  A hoped  to 
plunder.  Or  B may  be  a murderer  ar.d  A a hero  who  has  forced  his 
way  into  B’s  retreat.  In  both  cases  the  causal  judgment  is  the  same. 
A had  caused  the  minor  injuries  but  not  B’s  death,  though  ho  tried  to 
kill  hi;  a.  A may  indeed  be  praised  or  blamed  but  not  for  causing 
B’s  death.  However  intimate  the  connexion  between  causation  and 
responsibility,  it  does  not  determine  causal  judgments  in  this  simple 
way.  Nor  does  the  causal  judgment  turn  on  a refusal  to  attribute 
grave  consequences  to  actions  which  normally  have  less  serious 
results'.  Had  A’s  blow  killed  B outright  and  the  tree,  falling  on  his 
body,  merely  smashed  his  watch  we  should  still  treat  the  coincidental 
character  of  the  fall  of  the  tree  as  determining  the  form  of  causal 
statement.  We  should  then  recognize  A’s  blow  as  the  cause  of  B’s 
death  but  not  of  the  breaking  of  the  watch. 

The  connexion  between  A’s  action  and  B\>  death  in  the  first  case 
would  naturally  be  described  in  the  language  of  coincidence.  ‘It  was 
a coincidence:  it  just  happened  that,  at  the  very  moment  when  A 
knocked  B down,  a tree  crashed  at  the  very  place  where  he  fell  and 
killed  him.’  The  common  legal  metaphor  would  describe  the  fall  of 
the  tree  as  an  ’extraneous’  cause.  This,  however,  is  dangerously  mis- 
leading, as  an  analysis  of  the  notion  of  coincidence  will  show.  It 
suggests  merely  an  event  which  is  subsequent  to  ard  independent  of 
some  other  contingency,  and  of  course  the  fall  of  the  tree  has  both 
these  features  in  relation  to  A’s  blow.  Yet  in  these  respects  the  fall  of 
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the  tree  does  not  differ  front  the  everting  breeze  in  the  earlier  ca^e 
where  we  found  no  difficulty  in  tuncing  causal  connexion.1  The  full 
elucidation  of  the  notion  of  a coincidence  is  a complex  matter  for, 
though  it  is  very  important  as  a limit  in  tracing  consequences,  causal 
questions  are  not  the  only  ones  to  which  the  notion  is  relevant.  The  fol- 
lowing are  its  most  general  characterises.  We  speak  of  a coincidence 
whenever  (I)  the  conjunction  of  two  or  more  ev  ents  tn  certain  spatial 
or  temporal  relations  is  very  unlikely  by  ordinary  standards  and  (2) 
is  for  some  reason  significant  or  important,  provided  (3)  that  they 
occur  without  human  contrivance  and  (d)  are  independent  of  each 
other.  It  is  therefore  a coincidence  if  two  persons  known  to  each  other 
in  London  meet  without  design  in  Paris  on  their  way  to  separate 
independently  chosen  destinations;  or  if  two  persons  living  ir.  differ- 
ent places  independently  decide  to  write  a book  on  the  same  subject. 
The  first  is  a coincidence  of  time  and  place  ( It  just  happened  that 
we  were  at  the  same  place  at  the  same  time'),  and  the  second  a 
coincidence  of  time  only  (’It  just  happened  that  they  both  decided  to 
write  on  the  subject  at  the  same  time*). 

Use  of  this  general  notion  is  made  in  the  special  case  w hen  the  con- 
junction of  two  o:  more  events  occurs  in  temporal  and  or  spatial 
relationships  which  ere  significant,  because,  as  cur  general  knowledge 
of  causal  processes  shows,  this  conjunction  is  required  for  the  pro- 
duction of  some  given  further  event.  In  the  language  of  Mill's  idealized 
model,  they  form  a necessary  part  of  a complex  set  of  jointly  sufficient 
conditions.  In  the  present  case  *he  fall  of  the  tree  just  as  B was  struck 
down  w ithin  its  range  satisfies  the  four  criteria  for  a coincidence  which 
we  have  enumerated.  First,  though  neither  event  was  of  a very  rare 
or  exceptional  kind,  their  conjunction  would  be  rateJ  scry  unlikely 
judged  by  the  standards  of  ordinary' experience.  Secondly,  this  con- 
junction was  causally  significant  for  it  was  a necessary  part  of  the 
process  terminating  in  B’s  death  Thirdly,  this  conjunction  was  not 
consciously  designed  by  A;  had  lie  known  of  the  impending  fall  of  the 
tree  and  hit  B with  the  intention  that  he  should  fall  within  its  range 
B*$  death  would  not  have  been  the  result  of  any  coincidence.  A would 
certainly  have  caused  it.  The  common-  :cr  s:  principle  that  a contrived 
conjunction  cannot  be  a coincidence  is  the  element  of  truth  in  the 
legal  maxim  (too  broadly  stated  even  for  legal  purposes)  that  an 
Intended  conscquciicccannot  be  too  ‘remote'.  Fourthly,  each  member 
of  the  conjunction  in  this  case  was  independent  of  the  other;  whereas 
if  B had  fallen  against  the  tree  with  ar>  irr.pect  sufficient  to  bring  it 
down  on  him,  this  sequence  of  physical  events,  though  freakish  in  its 
way,  would  not  be  a coincidence  and  in  most  contexts  of  ordinary 
life,  as  in  the  law,  the  course  cf  events  would  be  summarized  by  saying 
1 A Voe.  r?  **  ct 
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that  in  this  case,  unlike  that  of  the  coincidence,  A*s  net  was  the  cause 
of  B's  death,  since  each  stage  is  the  cfTcct  of  the  preceding  stage. 
Thus,  the  blow  forced  the  victim  against  the  tree,  the  effect  of  this  was 
to  make  the  tree  fall  and  the  fall  of  the  tree  killed  the  victim. 

One  further  criterion  in  addition  to  these  four  must  be  satisfied  if 
a conjunction  of  events  is  to  rank  as  a coincidence  and  as  a limit  when 
the  consequences  of  the  action  are  traced.  This  further  criterion 
again  shows  the  strength  of  the  influence  which  the  analogy  with  the 
case  of  the  simple  manipulation  of  things  exerts  over  thought  in 
causal  terms.  An  abnormal  condition  existing  at  the  time  of  a human 
intervention  is  distinguished  both  by  ordinary  thought  and,  with  a 
striking  consistency,  by  most  legal  systems1  from  an  abnormal  event 
or  conjunction  of  events  subsequent  to  that  intervention;  the  former, 
unlike  the  latter,  are  not  ranked  as  coincidences  or  'extraneous* 
causes  when  the  consequence;  of  the  intervention  come  to  be  traced. 
Thus  A innocently  gives  B a tap  over  the  head  of  a normalty  quite 
harmless  character,  but  because  B is  then  suffering  from  some  rare 
disease  the  tap  has,  as  we  say,  ‘fatal  results*.  In  this  case  A has  caused 
B*s  death  though  unintentionally.  The  scope  of  the  principle  which 
thus  distinguishes  contemporaneous  abnormal  conditions  from  sub- 
sequent events  is  unclear;  but  at  least  where  a human  being  initiates 
Some  physical  change  in  a thing,  animal,  or  person,  abnormal  physical 
states  of  the  object  affected,  existing  at  the  time,  are  ranked  as  part  of 
the  circumstances  in  which  the  cause 'operates*.  In  the  familiar  con* 
trolling  imagery  these  are  part  of  ‘the  stage  already  set*  before  the 
•intervention*.8 

Judgments  about  coincidences,  though  we  often  agree  in  making 
them,  depend  on  two  related  ways  on  issues  incapable  of  precise 
formulation.  One  of  these  is  patent , the  other  latent  but  equally 
important.  Just  how  unlikely  must  a conjunction  be  to  rank  as  a 
coincidcr.ce,  and  in  the  light  of  what  knowledge  is  likelihood  to  be 
assessed?  The  only  answer  is:  ‘very  unlikely  in  the  light  of  the 
knowledge  available  to  ordinary  men/  It  is  of  course  the  5 nde ter- 
minacies  of  such  standards,  implicit  in  causal  judgments,  that  make 
them  invctcrately  disputable,  and  call  for  the  exercise  of  discretion  or 
choice  by  courts.  The  second  and  latent  indeterminacy  of  these  judg- 
ments depends  on  the  fact  that  the  things  or  events  which  they  relate 
do  not  have  pinned  to  them  some  uniquely  correct  descriptionalways 
to  be  used  in  assessing  likelihood.  It  is  an  important  pervasive  feature 
of  all  our  empirical  judgments  that  there  is  a constant  possibility  of 
more  or  tess  specific  description  of  ary  event  or  thing  with  w hich  they 
ire  concerned.  The  tree  might  be  described  not  simply  as  ‘a  tree*  but 
is  a ‘rotten  tree*  or  as  a ‘fir  tree*  or  a ‘tree  sixty  feet  tall*.  So  too  its 
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fall  might  be  described  not  as  a ‘fall’  but  as  a fall  of  a specified 
distance  at  a specified  velocity.  The  likelihood  of  conjunctions  framed 
in  these  different  terms  would  be  differently  assessed.  The  criteria  of 
appropriate  description  like  the  standard  of  likelihood  are  supplied 
by  consideration  of  common  knowledge.  Even  if  the  scientist  knew 
the  tree  to  be  rotten  and  could  have  predicted  its  fall  with  accuracy, 
this  would  not  change  the  judgment  that  its  fall  at  the  time  when  B 
was  struck  down  within  its  range  was  a coincide  nee ; nor  would  it  make 
the  description  ‘rotten  tree*  appropriate  for  the  assessment  of  the 
chances  involved  in  this  judgment.  There  are  other  controls  over  the 
choice  of  description  derived  from  the  degree  of  specificity  of  our 
interests  in  the  final  outcome  of  the  causal  process.  We  are  concerned 
with  the  fall  of  an  object  sufficient  to  cause  'death*  by  impact  and  the 
precise  force  or  direction  which  may  account  for  the  detail  of  the 
wounds  is  irrelevant  here. 

Opportunities  end  Reasons 

Opportunities.  The  discrimination  of  votuiuiry  interventions  as  a 
limit  is  no  longer  made  when  the  case,  owing  to  the  commonness  or 
appreciable  risk  of  such  harmful  intervention,  can  be  brought  within 
the  scope  of  the  notion  of  providing  an  opportunity,  known  to  be 
commonly  exploited  for  doing  harm.  Here  the  limiting  principles  are 
different.  When  A leaves  tl  ie  house  unlocked  the  range  of  consequences 
to  be  attributed  to  this  neglect,  as  in  any  other  case  where  precautions 
areoniittcd,  depends  primarily  on  the  way  in  which  such  opportunities 
are  commonly  exploited.  An  alternative  formulation  of  this  idea  is 
that  a subsequent  intervention  would  fall  within  the  scope  of  con- 
sequences if  the  likelihood  of  its  occurring  is  one  of  the  reasons  for 
holding  A’s  omission  to  be  negligent. 

It  is  on  these  lines  that  we  would  distinguish  between  the  entry  of 
a thief  and  of  a murderer;  the  opportunity  provided  is  believed  to  be 
sufficiently  commonly  exploited  by  thieves  to  make  it  usual  and  often 
morally  or  legally  obligatory  not  to  provide  it.  Here,  in  attributing 
consequences  to  prior  actions,  causal  judgments  are  directly  con- 
trolled by  the  notion  of  the  risk  created  by  them.  Neglect  of  such  pre- 
cautions is  both  unusual  and  reprehensible.  For  these  reasons  it  would 
t«  hard  to  separate  the  two  ways  in  which  such  neglect  deviates  from 
the  • norm*.  Despite  this,  no  simple  identification  can  be  made  of  the 
notion  of  responsibility  w ith  the  causal  connexion  which  is  a ground 
for  it.  This  is  so  because  the  provision  of  an  opportunity  commonly 
taken  by  others  is  ranked  as  the  cause  of  the  outcome  independently 
©f  the  wish  to  praise  or  blame.  The  causal  judgment  may  be  made 
simply  to  assess  a contribution  to  some  outcome.  Thus,  whether  wc 
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think  well  or  it!  of  the  use  made  of  railways,  we  would  still  claim  that 
the  greater  mobility  of  the  population  in  the  nineteenth  century  was  a 
consequence  :f  their  introduction. 

It  is  obvious  that  the  question  whether  any  given  intervention  is  a 
sufficiently  common  exploitation  of  the  opportunity  provided  to  come 
within  the  risk  is  again  a matter  on  which  judgments  may  differ  though 
they  often  agree.  The  courts,  end  perhaps  ordinary  thought  also, 
often  describe  those  that  are  sufficiently  common  as  'natural*  co\- 
sequences  of  the  neglect.  They  have  in  these  terms  discriminated  *he 
entry  of  a thief  from  the  entry  of  a man  who  burnt  the  house  down, 
and  refused  to  treat  the  destruction  of  the  house  as  a ‘natural’  con- 
sequence of  the  neglect.1 

We  discuss  later  in  Chapter  IX  the  argument  that  this  easilv  intel- 
ligible concept  of  'harm  within  the  risk’,  over-riding  as  it  d)es  the 
distinctions  between  voluntary  interventions  and  others,  should  be 
used  as  the  general  test  for  determining  what  subsequent  harm  should 
be  attributed  for  legal  purposes  to  prior  action.  The  merits  of  this 
proposal  to  refashion  the  law  along  these  simple  lines  are  p:rhaps 
considerable,  yet  consequences  of  actions  are  in  fact  often  traced  both 
in  the  law  and  apart  from  it  in  other  ways  which  depend  on  the  dis- 
crimination of  voluntary  interventions  from  others.  Wc  distinguish, 
after  all,  as  differing  though  related  grounds  of  responsibility,  causing 
harm  by  one’s  own  action  and  providing  opportunities  for  other?  to 
do  harm,  where  the  guiding  analogy  with  the  simple  manipulation  of 
things,  which  underlies  causal  thought,  is  less  close.  When,  as  in  the 
examples  discussed  above,  we  trace  consequences  through  the  non- 
voluntary intervene  ms  of  others  our  concern  is  to  show  that  certain 
stages  of  the  procey,  nave  a certain  type  of  connexion  with  the  preced- 
ing stages,  and  not,  as  when  the  notion  of  risk  is  applied,  to  show  that 
the  ultimate  outcome  is  connected  in  some  general  way  with  the 
initiating  action.  Thus,  when  A’s  shot  nukes  B start  and  break  a glass 
it  is  the  causal  relationship  described  by  the  expression  'made  B 
start*  that  we  have  in  mind  and  not  the  likelihood  that  on  hearing  a 
shot  someone  rmy  break  a glass.  Causal  connexion  may  be  traced  in 
such  cases  though  the  initiating  action  and  the  final  outcome  are  not 
contingencies  that  commonly  go  together. 

Apart  from  threconceptual  reasons  for  distinguishing  these  related 
grounds  for  responsibility,  it  is  clear  that  both  in  tl  e law,  as  we  show 
in  Chapter  VI,  and  apart  from  it  we  constantly  treat  harm  as  caused 
by  a person’s  action  though  it  does  r.o:  fall' within  the  risk’.  If,  when 
B broke  the  gla^s  in  the  example  gi\en  above,  a splinter  flew  into  C’$ 
eye,  blinding  him,  A’s  action  is  indeed  the  cause  of  C’s  injury  though 
we  may  not  always  blame  him  for  so  unusual  a consequence. 

1 y.  Were*  Her  Steregi  Ce . (19J7),  P7  Mm.  I SI,  ? N.E.  2d  511. 
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Reasons.  In  certain  varieties  of  interpersonal  transactions,  unlike 
the  case  of  coercion,  the  second  action  is  quite  voluntary.  A may  not 
threaten  B but  may  bribe  or  advise  or  persuade  him  to  do  something. 
Here,  A does  not  ’cause'  or  'make'  B do  anything:  the  strongest 
wards  we  should  use  are  perhaps  that  he  ‘induced’  or ‘procured’  B's 
r.ct.  Yet  the  law  and  moral  principles  alike  may  treat  one  person  as 
responsible  for  the  harm  which  another  free  agent  has  done  ‘in  con- 
sequence* of  the  advice  or  the  inducements  which  the  first  has 
offered.  In  such  cases  the  limits  concern  the  tange  of  those  actions 
done  by  B which  are  to  rank  as  the  consequence  of  A’s  words  or 
deeds.  In  general  this  question  depends  on  A’s  intentions  or  on  the 
'plan  of  action’  he  puls  before  B.  If  A advises  or  bribes  B to  break  in 
and  steal  from  an  empty  nouse  and  B does  so,  he  acts  in  consequence 
of  A’s  advice  or  bribe.  If  he  deliberately  burns  down  the  house  this 
would  not  be  treated  as  the  consequence  of  A’s  bribe  or  advice,  legally 
or  otherwise,  though  it  may  in  some  sense  be  true  that  the  burning 
would  not  have  taken  place  w ithout  the  advice  or  bribe.  Nice  questions 
may  arise,  which  the  courts  have  to  settle,  where  B diverges  from  the 
detail  of  the  plan  of  action  put  before  him  by  A. 
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Torts  (Boston*  Little,  Brown  and  Company, 
1956),  pp  1108-1121* 


§20.1.  Introductory.  Negligence  is  not  a ground  of  liability 
unless  it  causes  injuiy  or  damage  to  some  interest  which  the  law 
recognizes  and  protects.  Moreover,  it  docs  not  make  a defendant 
liable  for  any  injury  or  damage  that  is  not  a comcquencc  of  the 
negligence*  The  establishment  of  the  requisite  causa!  connection 
is  thercfoic  an  element  of  a plaintiffs  cause  of  action  for  negli- 
gence, to  be  pleaded  and  proven  by  !i :in.  And  where  a came  of 
action  exists,  the  question  of  causal  connection  will  determine  the 
scope  of  liability  — the  extent  of  the  injury  or  damage  for  which 
defendant  will  have  to  pav.  The  question  then  naturally  arises 
what  is  the  kind  of  causal  connection  or  relationship  that  the  law 
rcouircs  to  be  pro\c;i.  Ob\  ioas!y  the  legal  test  includes  a require- 
ment that  the  wrongful  conduct  must  be  a cause  in  fact  of  the 
harm;  but  if  this  stood  alone  the  scope  of  liability  would  be  vast 
indeed,  lor vjthc  causes  of  causes  farc]  infinite”  V-  "the  fa tal* tres- 
pass done  by  Eve  was  cause  of  all  our  woe-”  Dut  the  law  has  not 
stopped  there  — it  ha*  developed  fvnlhct  testtittiom  and  limita- 
tions. The  concept  tint  development  lias  produced  is  generally 
called  “proximate''  or  “legal”  cause.  To  be  sure  this  concept  is 
only  one  of  the  devices  used  to  limit  the  fact  and  the  extent  of 
liability  for  negligence.1  In  the  progress  of  negligence  law,  how- 
ever, the  concept  of  ptoxiinate  cause  has  been  greatly  overworked 

$20.1,  1 LitAirattons  on  the  re^pc  of  duly  irJ  tfc t n;!e  of  corUibiiiory 
arc.  for  e\air.p!o  cUnr  such  do  kef 
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to  limit  or  control  both  the  liability  of  defendant  and  the  effect 
of  contributory  negligence  because  of  many  considerations  which 
can  be  treated  in  a more  meaning  ul  and  significant  way  in  con- 
nection with  other  issues,  such  as  that  of  duty,  standard  of  conduct 
and  the  like,2  “Having  no  imeginted  meaning  of  its  own,  [the] 
chameleon  quality  [of  proximate  cause]  permits  it  to  be  substi- 
tuted for  any  one  of  the  elements  of  a negligence  case  when 
decision  on  that  element  becomes  dillicult.  . . . No  other  formula 
• . . so  nearly  does  the  work  of  Aladdin's  lamp."3  The  result 
has  been  a widely  recognized  confusion,  and  as  luxuriant  a crop 
of  legal  literature  as  is  to  be  hud  in  any  branch  of  tort  lav/,4  Per- 
haps recent  years  have  seen  a little  headway  made  in  dispelling 
the  confusion  and  taking  some  of  cite  work  load  off  of  mis  weary 
concept  by  separating  other  strands  for  analysis  and  treatment 
under  other  more  appropriate  headings.  If  this  is  so,  much  of 
the  credit  belongs  to  Dean  Green. 

2 See  Green,  Rationale  of  Proximate  C^usc  75,  122  ct  icq.  (1927);  Carpenter, 
Workable  Rules  for  Determining  rroxirn.uc  C u;«c,  2D  Calif.  L.  Rev.  £29,  i!46-?53 
(1552);  ard  Malihle  C J.,  in  kindcr.xsitch  rainier,  127  Conn.  S3,  S9  ct  scq.,  IS 
A.2d  $3,  (1910). 

^ Green,  Proximate  Cause  in  Texas  Negligence  Lv.v,  £$  Texas  l..  Rev,  471,  471- 
472  (1930). 

*TLc  following  is  a suggestive  but  not  exhaustive  list  of  literature:  Green, 
Rationale  of  Pinxhnaic  Cause  (1927);  Bohlcrt,  The  Probable  or  the  Natural  Conse- 
quences as  th.c  Test  of  Liability  in  Negligence,  49  Am.  L Reg.  79,  MS  (1901); 
Bingham,  Some  Suggestions  Concerning  ‘Legal  Caine**  at  Common  Law,  9 Coluin. 
L.  Rev.  16,  1 36  (1909);  Smith,  Legal  Cause  in  Actions  of  Toil,  25  Harv.  L.  Rev. 
103,  223,  303  (1911);  Terry,  Proximate  Consequences  in  the  law  of  Yotts,  2S  Hat v. 
L.  Rev.  10  (191 4);  Beale,  The  Proximate  Consequences  cf  an  Act,  S3  Harv.  L. 
Rcm.  633  (1920);  Pollock,  Ltabilii)  for  Consequences,  35  L.Q.  Pvcv.  165  (1922):  Levitt, 
Cause,  Leg.U  Cause  and  Proximate  Cause,  21  Mich.  L Rev.  54,  *.69  (1922);  Reese, 
Negligence  and  Proximate  Cause,  7 Cornell  1 Q.  95  (1922);  Creen,  Are  Negligence 
and  "Proximate*’  Cause  Determined  by  the  $-.:«c  Test*  1 Texas  L.  Rev.  242,  23$ 
(1925);  Fdgciton,  Lrg.il  Cause,  72  U.  Pa.  1 Rev.  211,  3iJ  (1924);  McLaughlin, 
Proximate  Cause,  39  Itarv.  L.  Rev.  U9  (1925);  Burl  Rules  of  Legal  Cause  in 
Negligence  Ov:*,  15  Calif  L.  Rev.  1 (1926);  Green,  Are  There  Dependable  Rule* 
of  Causation,  77  U.  Ta.  L.  Rev.  C01  (1929);  Cai[  enter,  Wo  Valle  Rules  for  De- 
termining Proximate  Cause,  20  Calif.  L.  Rev.  229,  393.  471  (1932);  Tcaslec,  Multiple 
Causation  an  ! Damage,  47  Uaiv.  L.  Re*.  1127  (I Bat);  Carpenter,  Concurrent  Causa- 
tion, 13  U.  Pa.  L.  Rev.  911  (1935);  Ptosscr.  The  Minnesota  Ooutt  cn  Proximate 
Cause,  21  Minn.  L Rev.  19  (1956).  Campbell,  Duty  Fault  and  Legal  Cause,  (193$) 
WIs.  L Rev.  402;  Gregory,  Proximate  Cau<e  in  Nrglsgciscc  — A Retreat  from 
RaiioftMitafio.i,  $ U.  Chi.  I..  Rev  35  Morris.  On  the  Teaching  of  Legal 

Cause,  39  Coturu.  L.  Rev.  1057  (1959);  °xtpenur,  Proximate  Cause,  14  So.  Calif. 
L Rev.  I,  115  (1910).  15  So.  Call!.  I Rev.  H7,  301,  427  (I9ll),  16  So.  Calif.  L. 
Rev.  1,  61,  275  (1912);  Monis,  F roximate  Cau^c  in  Minntota,  31  Minn.  L.  Rev. 
IS5  (195C).  Gicen.  Pioxi-r.a'c  Ca  i*e  in  Texts  Negligence  L it-*,  23  Texas  L.  Rev. 
471,  621,  735  (1930);  Monis,  Duty,  Negligent  a id  Causation,  101  U.  Pa.  L Ret.  139 
(1932>;  Green,  Proximate  Cause,  17  T exas  V-  j 161  (1954). 

The.ve  writings  represent  nearly  evert  concc liable  pairt  cf  view.  For  reasons 
that  (he  pre'Cnt  authors  fmd  elusive,  Metis*  g^ii  so  far  as  to  disparage  the  re* 
quiienjnt  of  cause  in  fact. 
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§20.2.  Causal  relation  or  cause  in  fact.  Through  all  the  di- 
verse theories  of  proximate  cause  rims  a common  thread;  all  agree 
that  defendant’s  wrongful  conduct  must  be  a cause  in  fact  of 
p^aintilFs  injury  before  thcTc  is  liability.  This  notion  is  not  a 
metaphysical  one  but  an  ordinary,  matter-of-fact  inquiry  into  the 
existence  or  nonexistence  of  a causal  relation  as  laymen  would 
view  it.  Clearly  this  is  not  a quest  for  a sole  cause.  Probably  it 
cannot  be  said  of  any  event  that  it  lias  a swigle  causal  amreedem; 
usually  there  are  many.  ; For  the  purpose  of  the  prc<ent  inquiry 
it  is  enough  that  defendant’s  negligence  be  a came  in  fact  of  the 
harm.  N 

A rough  working  test  of  this  relation,  \alid  for  most  cases  ami 
enjoying  wide  currency,  is  the  “but  for"  or  sine  qua  non  test: 
defendant's  negligence  is  a cause  in  fact  of  an  iujuiy\whcrc  the 
injury  would  not  have  occurred  but  for  defendant's  negligent  con- 
duct.1 It  is  probably  safe  to  say  that  wherever  this  test  is  met,  the 
cause  in  fact  relation2  docs  exist.  JQut  the  test  is  not  universally 
valid  as  a test  of  exclusion  of  causes  in  facty  There  is  one  situation 
where  it  will  not  work,  If  two  independent  causes  concur  to  pio 
ducc  a reside  which  either  of  them  .done  would  have  produced, 
each  is  a cause  in  fact  of  that  result  though  it  would  have  hap- 
pened without  either  cause  (but  not  if  both  causes  were  absent)  * 
Thus  if  two  fires  from  separate  sources  combine  as  the)  reach 
plaintiff’s  property,  and  consume  it,  each  is  a cause  though  it  be 
assumed  or  shown  that  the  property  would  have  been  consumed 
as  completely  by  either  fire  alonej 

So  far  as  the  jitlulanfit/c  law  of  torts  goes,  the  cause  in  fact 
aspect  of  the  requirement  of  legal  cause  gives  little  trouble,  and 
it  lias  been  called  a simple  one.4  Vet  (here  are  problems  here 

§20  2.  1 Carpenter,  WoiVablc  RuUs  for  Determining  Proximate  Cau*e,  20  Calif. 
L.  Rev.  229,  255-25$,  596-597  <1992);  Smith,  L'gal  Came  in  Artiom  of  Tori.  25 
H atv.  L Rev.  105  ICS-1 10  (1911). 

• But  not  nectionljr  the  relationship  of  legit  or  ptoximate  tauw.  It  has  been 
saM  that  the  "but  (01”  tot  is  \ at i J a*  a lot  of  t\tUi*;cn  (but  not  of  inci  sion)  for 
proxvnite  cause  (except  in  the  coic  el  the  concuning  efficient  caii<c*  gnen  in  the 
text).  Smith,  Legal  Cau«e  in  Action*  of  Tort,  25  Ihiv.  L Rtv.  105,  ICO  (121 1) 
A moment's  icfkciioft  will  meal  that  this  siattiner.l  ami  ti  e one  in  the  text  i'-hicS 
Is  ccnce.ncd  Kith  tame  is  feel  only)  arc  r.ol  at  all  incownMcnl.  Oi  pen  ter,  WorV 
able  Rules  for  Determining  Troxiinate  Cause,  20  Calif.  L.  Rev.  229,  526  (1952). 
remarks,  obliquely,  that  the  but  for"  rule  n ay  be  mej  at  a test  for  nrefurion  with 
report  to  proximal e censes  mJ  of  inctuthn  of  cauws  in  fact. 

• See  rrosser.  Tour  220  fid  cd.  1955). 

< Green,  Rationale  of  Proximate  Ciure  156  (1927);  Green,  Proximate  Cause  in 
Tcoi  Negligence  Law,  :$  Texa?  L Rev.  171.  4*4.  -176  <1952).  Of  course,  if 
ararraruably  confuKj  with  other  aspect*  of  a neglige  nee  cue,  the  "tausitiofT  i»-e 
thus  created  may  appear  extremely  complex. 
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which  often  beset  the  trial  court  and  practitioner  and  prove  fatal 
to  many  a case.  jThe  most  serious  of  these  problems  is  that  of 
sufficiency  of  proof  where  the  ascertainable  facts  arc  meager,  or 
where  they  present  complicated  questions  of  science,  medicine, 
engineering,  or  the  like.  Another — -usually  less  serious  — prob- 
lem is  in  finding  appropriate  language  for  the  charge.^ 

The  problem  of  proof  here  is  essentially  like  that  presented  by 
the  problem  of  proving  ncghgcnce  circumstantially,  and  the  two 
often  overlap.  Thus  in  res  i^sa  loquitur  eases  the  requirement  of 
defendant's  exclusive  control  of  the  injuring  agency  really  calls  for 
nothing  tnoic  than  proof  tending  to  eliminate  other  possible 
causes  of  the  occurrence,  so  as  to  indicate  that  the  negligence  of 
which  that  occurrence  speaks  is  probably  that  of  defendant.5  But 
where  negligence  is  pinned  on  the  defendant  by  direct  or  other 
sufficient  proof  there  may  still  remain  a serious  problem  of  prov- 
ing the  causal  relation.  This  is  not  always  so  — perhaps  in  the 
bulk  of  accident  eases  the  real  issues  turn  on  credibility  of  wit- 
nesses and  the  evaluation  of  conduct.*  But  there  is  a substantial 
number  where  causal  relation  re^is  on  inference  from  circunv 
stanral  evidence,  and  in  them  is  found  the  same  test  of  delusive 
exactness  that  is  generally  used  to  gauge  the  legitimacy  of  the  basis 
for  an  inference  front  circumstantial  evidence.  "Whcic,"  the  New 
York  court  has  said,  "the  facts  proven  show  that  there  are  several 
possible  causes  of  an  injury,  for  one  or  more  of  which  the  de- 
fendant was  not  responsible,  and  it  is  just  as  reasonable  that  the 
injury  was  the  result  of  one  cause  or  the  other,  p!aim:fl  cannot 
have  a recovery  since  he  has  failed  to  prove  that  the  negligence 
of  the  defendant  caused  the  injury.”  T But  here,  as  elsewhere, 

S$ce  f19.7  supra ; James.  Proof  of  the  Breach  in  Negligence  Coses,  37  Vt.  L.  P.er. 

1*9,  29  f (t  icq.  (1971).  The  lot  of  legitimacy  of  inference*  from  circumstantial 
evidence  is  discussed  in  §19.4  supra. 

♦ Thus,  vhtre  there  arc  C)C^!tr,e**cs  tc  an  accident,  their  testimony  will  generally 
lufhcc  to  establish  defendant*!  breach  of  duty  anJ  direct  causation  of  plasntifF* 
injuries.  i' 

Mngcrsoll  v.  I.lbaty  Hanl  of  Buffalo.  273  N Y.  1,  7,  14  N.t.Cd  $2$,  $29  (193$). 

See  tWo  Ailing  v.  Northwestern  Be  l Teh  Co..  BO  Minn.  CO.  63.  191  NAV.  313,  314 
{1923)  f The  burden  ii  cn  the  phiintifT  to  show  lb  t it  is  more  probable  that  the 
harm  resulted  in  consequence  of  something  f<-.t  fchicb  the  defendant  *a*  responsible 
than  in  conscqjcuee  of  something  for  vhleh  ho  was  cot**),  and  Rjnbcrg  t.  Morgan, 

209  lo*o  474.  457.  213  NAV.  4 92,  493  (IC23)  f Where  ti  e p:oct  disclose*  that  a 
gisen  result  may  base  occurred  by  reason  rt  more  than  one  proximate  cause,  and 
the  jury  can  do  no  more  than  guc.s  or  speculate  a>  to  *hkh  tvas,  in  feel,  the  effi- 
cient c.hisc,  the  submission  of  such  choree  to  the  jury  has  been  consistently  con- 
demned by  this  court  and  by  other  courts  *;  quoted  with  app’osvl  in  Frye  r.  City 
of  Detroit!  27G  Nheh.  4C«,  470,  239  NAV.  *s;  (1932). 

Cf.  the  more  liber  J statement  cf  the  te«f  found  In  ltir,de  r.  Cudahy  Patling  Co, 

IV)  lowi  6<*.  7ot.  II#  NAV.  1005.  1003  MW**)  and  quoted  Mrh  apptosal  in  BoeV 
v.  Fell  man  Dry  Good*  Co„  212  $AV.  $33,  tf«  (Tex  tir.  App.  1919)  ( A cause  be; t\g 
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this  test  permits  wide  latitude  where  facts  arc  meager  and  where 
it  is  pure  matter  of  guesswork  where  the  greater  probabilities  lie. 

A man  is  found  dead  at  the  foot  of  an  unlightctl  apartment  hall- 
way,8  or  at  the  bottom  oc  an  elevator  well  one  of  the  entrances  to 
which  was  left  unguarded,*  or  at  a railroad  crossing  after  the 
passing  of  a train  that  failed  to  whistle.10  Death  occurred  at  the 
unsafe  place,  or  at  the  hands  of  the  negligently  operated  agency, 
but  did  it  occur  because  of  the  defect  or  the  negligence?  Would 
the  precautions  demanded  by  reasonable  cave  have  averted  the 
accident?  Obviously  no  one  knows — moreover  no  one  even 
knows  what  the  probabilities  arc,  and  the  courts  arc  free  to  choose 
either  a postulate  which  will  let  the  jmy  draw  an  inference  of 
negligence,  or  its  opposite. 

On  the  meager  kind  of  showing  described,  therefore,  it  is  not  . 
surprising  to  find  variant  results.11  Courts  which  take  a strict 
attitude  stress  the  fallacy  of  reasoning  post  hoc  ergo  propter  hoc,15 
and  the  likelihood  of  explanations  involving  no  causal  relation: 
"It  is  a matter  of  common  knowledge  that  people  sometimes  fall 

shown  which  might  ptoducc  .in  acridei.t,  and  it  further  appearing  that  an  accident 
of  that  particular  character  did  occur,  it  is  a tor  Tamable  inference,  in  tin  absence 
of  shoeing  of  other  cause,  that  the  one  known  was  the  op„r?.ti\c  2£cuc>  in  bringing 
about  such  results**). 

Mngcrsoll  v.  l iberty  Dank  of  Buffalo,  27$  N.Y.  I,  14  N fcfM  *7*  (IMS);  Wolf 
v,  Kaufrnann,  227  App,  Div.  281,  237  X.Y.  Sirpp.  530  (1920);  vL  Boinsrcin  v.  Faden, 
149  App.  Div.  37,  133  N.Y.  Snpp.  603  (1912),  *Q'd,  203  N.Y.  <Ak  102  N F.  1099 
(1913);  Reynolds  v.  Teos  fc  P.  Rv,  Co.,  37  Li.  Ann.  C94  (IS$5);  Fairc  w Gamble 
Stores,  202  Minn.  402,  279  NAV.  257  (1958). 

tlfopMnscn  v.  Knapp  8*  Spalding  Co.,  92  Iqkj  $2$,  00  N.W.  653  (1$9l);  Foci: 
v,  Fcllrn-n  Dr)  Goods  Co,  212  SAY.  G53  (Tcv.  Civ,  App.  1019);  MiCowat* Mercer 
Printing  Co.  v.  Taylor,  115  F 2d  (G:h  Cir.  1910). 

loPuckfubcr  v.  So.  \'?c.  R.  Co.  152  CM.  353.  01  Fac.  450  (1201);  Burlington- Rock 
Island  R.  Co.  v.  tllicon,  MO  Tcv  355,  107  SAVCd  723  (1913).  Cf.  Woodworth  v. 
N.Y.  Central  R.  Co.  149  Ohio  St.  515.  50  N K2d  .42  (1945),  soundly  tt  it  id  ted  in  a 
Note,  IS  U.  Cin.  L.  Rev.  371  (1949). 

44  On  nearly  identical  fads  even  the  same  judge  may  eoine  to  contradictory  eon* 
elusions  at  different  times.  Compare  Wolf  v.  Kaufmar.n,  227  App.  Div.  251, 23?  N.Y. 
Supp.  550  (1929)  (denying  rccovciy  for  death  of  plaintiff  s intestate  found  uncon* 
relou*  at  ftot  of  stairway  tnlightid  in  violation  of  statute)  with  fngersoil  v.  Liberty 
Bank  of  Buffalo,  273  N.Y.  I,  14  N F.2d  528  (197?^  (directly  opposed).  Both  opinion! 
were  by  Finch,  J, 

15  See,  eg.,  Springer  v.  Security  Nat.  Dank  etc  Co,  175  S.WJM  797  (1943)  (con- 
nection between  alleged  negligence  of  froktr  and  lews  to  customer  not  established): 
F*)ne  v.  Chandler,  41  Ca.  App.  555,  155  5 F.  90  (1950)  (evidence  that  piainiifl 
suffered  pain  in  heart  and  ©diet  ailments  afut  denriu  caused  her  to  swallow  tin* 
Identified,  unplcasant-tauirg  liquid  is  inm.Tcitnl  t^  authoiirc  inference  of  causal 
relation);  Ktamcr  Service  v.  Wilkins,  154  Mivs  4S3,  156  So.  624  (1939)  (connection 
between  traumatic  injury  to  hand  and  tantet  of  skin  at  point  of  injury  rot  es- 
tablished). 

But  cf.  Reynolds  v,  Texas  k P.  Ry.  Co.,  3?  La.  694  (1SS;>).  In  which  the  court 
recogfd/cd  the  "distinction  between  foil  her  and  f topur  hce/*  but  found  for  the 
plaintiff  anyway  because  "the  k*  vie  tcndcrey  of  the  evidence  connect!  the  accident 
with  the  hcgligcnce.** 
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downstairs  in  broad  daylight*  ;15  the  pedestrian  who  failed  to  hear 
the  train  might  well  not  have  heard  the  whistle,  perhaps  he  fainted 
on  the  tracks,11  and  so  on.*5  Courts  which  take  a sympathetic 
view  point  out  that  defendant's  negligence  greatly  multiplied  the 
chances  of  an  accident  of  just  the  kind  plaintiff  claims  happened, 
that  the  proven  facts  are  entirely  consistent  with  its  having  hap- 
pened that  way,  and  that  the  evidence  does  not  strongly  suggest 
an  alternative  explanation.1* 

In  most  eases  the  facts  are  not  as  meager  as  those  described 
above  and  the  additional  evidence  generally  reduc  :s  the  area  of 
uncertainly  and  enables  the  Cv_urt  to  make  a much  more  sure- 
footed judgment  on  the  matter  of  probabilities,  though  even  here 
there  is  often  room  for  difference  of  opinion.  On  the  whole, 
courts  have  tended  to  view  with  liberality  the  legitimacy  of  the 
inference  of  causal  relation  in  these  cases/  Thus  where  it  appears'7 
hcu>  an  accident  happened  ami  also  that  the  victim  rrtig/if  have 
saved  himself  by  taking  advantage  of  a precaution  which  it  has 
been  shown  defendant  negligently  failed  to  afford,  courts  have 
generally  let  a jury  find  the  failure  caused  the  harm,  though  it  is 
often  a pretty  speculative  matter  whether  the  precaution  would 
in  fact  have  saved  the  victim.11  ^ Of  course  if  the  possibility  of 

11  Dissenting  opinion  by  Metaughlin,  J , in  Bornstcln  v.  F.idcn,  149  App.  DIv.  57, 

42,  115  N.Y.  Svipp.  $0$,  611  *(1912).  203  N\Y.  CA\  W2  NT.  10 9)  (1915).  See 

also  Wolf.  v.  Knufmann,  227  App.  DA.  2St,  237  N.Y.  $upp.  3)0  (K29). 

HRueVhaber  v.  So.  Pac.  R.  Co..  152  Cal.  SCO.  Cl  Fac.  4S0  (tCOl). 

»For  example,  the  unsympathetic  rejection  of  inferences  as  to  the  cause  of  the 
• tfplanc  crash  in  vhich  the  plaintiff's  decedent  was  kdlcd  in  Morrison  LeTcur* 
ncau  Co.  of  Ccoigia,  15S  F.2d  53)  (5th  Cir.  1913),  criticircJ  in  Note,  55  IU.  L Rev. 
326(1911). 

l*Scc  Reynolds  v.  T cvai  1-  p.  Ry.  Co.  57  1 Ann.  691.  693  (ISS5)  fVbcre  the 
negligence  of  the  defendant  great!)  multiplies  the  cancel  of  accident  to  the  plain- 
tiff. and  i\  of  a character  naturally  leading  to  its  occurrence,  the  mere  possibility 
♦hat  It  might  ha%c  happened  vithout  the  negligence  is  not  sufficient  to  break  the 
chain  of  cause  and  effect  between  *he  negtigcrce  *nd  the  injur)");  I.uni'e  ?.  Cudahy 
fading  Co.,  139  lova  117  NAV.  1065  (I9CU)  (statement  quoted  in  note  7 lupra); 
Emery  *,  Tilo  RooSng  Co  . *9  N il  165.  195  At?  409  (19371;  Catci  v.  B.  Sc  M R.  Co , 
255  Mass.  207,  la!  NT.  320  (1376);  P.nrlingUMi  Ro  A Hand  R.  Co  t.  Ellison.  140 
lex.  353,  \07  S U .2d  72J  (»9I5>,  Texas  k N O R.  Co.  r.  riving,  45  $.W2M  593  (Tex. 
Civ.  App.  1931);  P.ot  ft  stein  v.  fatten,  119  App.  Div.  37.  t53  N.V.  Supp  603  (1912). 
iQ'S,  2ci?  N-V.  605,  102  NT.  1099  (1913);  CornbrmaVs  v.  Terminal  Barber  Shops. 
Inc.  2*2  N.V.  217,  25  NT.  2d  23  (I<>10);  Note.  45  Mich.  Rev.  535  (1950).  Cf. 
note  S3  in/,*. 

H$ce  2 Restatement  of  Torts  §132,  Comment  r.  In  7inrel  v.  U.  S.  Shipping 
Roaid,  10  f.2d  47,  49  (2d  Cir.  1925),  i)  c i<>»ie  Was  whether  a guard  rope,  the 
iWr.ee  of  vhich  constituted  the  allege  ! ncgli;cHcc  of  the  defendant,  would  base 
presented  the  phiMiff*  intestate  horn  bcinj  Crashed  oveibojrd.  L.  Hand,  J.,  said, 
in  resetsing  a judgment  d;«mif*ing  the  complaint:  NAbo;:t  tint  ve  agree  no  certain 
conclusion  vai  possible.  Nobody  could,  in  the  future  of  ihirg5,  be  suic  that  the 
Intestate  would  hast  seized  the  rope.  cr.  it  1 e had  n>t.  thit  it  vou'd  base  stepped 
his  b*ty.  Ktui  vc  ate  net  . * » justified,  vine  ccxtair.i)  U itrposdble,  in  insbtirg 
it.  We  cannot  sxy  t!  at  there  vas  no  IAcIiLv)!  that  a tope  three  feet  abose 
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salvation  is  too  dcmly  foreclosed  by  the  facts  of  a given  case,  the 
inference  may  not  be  drawn.11  Grade  crossing  collisions  furnish 
familiar  examples.  Where  the  victim’s  automobile  is  driven  onto 
the  crossing  immediately  in  ft  vn  of  an  obvious  and  very  audible 
train  the  jury  still  may  find  that  the  driver’s  ob!i\?ous  attention 
uoutd  have  been  arrested  in  time  if  the  whistle  had  been  blown.1* 

t’"t  deck  . . . would  not  rave  saved  the  seaman.  , . . Consid.rtng  that  such  liner 
were  n>n  for  the  express  purpose.  .imon^  oiHet »,  of  projecting  seamen,  uo  think 
it  a qi'caion  about  which  rcafcnaldc  men  might  at  lean  differ  whether  the  in- 
testate ^outd  not  have  bion  «n*cJ  hr  1 it  been  there.”  Sec  aho  Kiundch  v.  Stand- 
ard Dredging  Co.,  112  K.2d  163  (3d  Cir.  12«0).  wherein  a seaman  who  could  not 
swim  fell  outboard  and  was  lasted  a one-inch  leaving  line  which  lardo  1 two  feel 
from  where  he  was  struggling:  *hc  court  found  failure  to  .icnve  an  available  ring 
buoy  to  be  the  cause  in  Tact  o»  the  diowning.  ".  . . we  think  he  might  (the  ap- 
propriate grammatical  mood)  base  wed  himself  through  the  help  of  something 
he  could  more  easily  grab."  112  F.iM  at  I (■  1.  Cf.  Partis  v.  Pennsylvania  R.  Co., 
50  F.2d  865  (4ih  Cir.  1031)  («crnWc*  but  HatiU  was  a good  swimmer  and  therefore 
there  uis  even  greater  probability  of  his  reaching  a ring  buoy  or  heaving  line, 
both  of  which  were  available,  had  defendant's  servant*  thrown  cnc). 

M NY.  Cent.  R.  Co.  v,  GrinuUd,  2M  Fed.  351  t‘7d  C'ir.  1920),  contrasts  nice!)  with 
the  caws  cited  in  rote  17  suj't/i  and  illustrates  the  point  of  difference  made  in  the 
tevt.  I he  man  overboard  could  not  sw in,  b it  was  «een  b\  his  wife  while  he  was 
still  struggling.  She  lushed  back  into  the  cabin  for  a Hne.  but  when  slic  returned 
hr  had  disappeared.  The  negligence  alleged  was  the  failuto  to  equip  the  barge 
Kith  a life  buoy,  but  the  court  found  any  carnal  connection  between  this  failure 
and  the  drowning  to  be  "pure  conjccuie  and  speculation.*  There  was  nothing 
to  show  that,  if  there  had  loco  a life  buoy  on  Ix-aid,  the  wife  would  have  got  It 
Sooner  hao  she  got  the  jiiiall  line.  To;d  v.  Trident  Fisheries  Co.,  232  Mass.  400, 
122  N E.  $99  (1919)  is  similar  (negligence  with  ifg.isd  to  readiness  of  lifeboat  im- 
material as  man  disappeared  tnuncdhuMv  upon  hilling  overboard,  and  therefore 
could  rot  have  been  saved  even  if  defendant  had  tint  been  negligent). 

tr  will  be  rceognired  that  thc*c  eases  itliiMu-tc  the  converse  of  the  ' but  for"  rule: 
If  the  accident  would  have  happened  withe. til  dcf<  i daut’s  negligent  act,  then  such 
ii  not  the  cause  of  it.  See  Russo  v.  Au<-  in,  7 ho.  2d  741  <ba.  App.  I?12)  (negligence 
of  D‘  fn  pulling  away  from  edge  of  read  without  signaling  held  not  to  be  tame 
of  accident  in  v*hich  D-«  gmng  southward,  as  was  D\  came  sjKCvling  around  Mind 
curve,  saw  Tb  in  his  path,  hut  being  unable  to  stop  by  reason  of  defective  brakes 
gta/ed  tb's  car  ar  i plowed  Into  the  pbrntifFi  toul;.  which  was  going  northward; 
even  if  th  had  signaled,  the  accident  would  hue  happened  tie  same  way);  Flecheriv 
v.  Sullivan,  129  $ W 2d  925  (Mo.  App.  1959)  fiihuc  to  warn  by  founding  hern  net 
proximate  cause  cf  injury  where  p;i  s -n  to  be  wit  ned  and  complaining  of  (he  failure 
was  aware  of  the  impending  dar.geti:  $ucy  v.  Knickerbocker  tee  Co.,  \VK  614, 
M NAV.  1091  (1S93)  (failure  to  Indicate  Kxotlcn  ©f,  or  to  fence  in,  hole  in  ice  rot 
ihe  tivUi  cf  the  drowning  cf  runaway  korfe*  since  the  horses  could  not  have  been 
halted  or  «awd  had  all  these  precaution*  hc-n  taken):  Citv  c-f  Piqua  v.  Morris,  93 
Ohio  St.  42.  120  N F SCO,  7 AI  R.  129  , i 2 i 5 ) ,rwi*  gence  c 4 city  In  constructicn  or 
maintenance  of  spillway  rot  the  cau«.c  el  overCov  cf  pond  to  plaintiffs  damage, 
where  extraordinary  rainfall  would  have  cau«ed  perd  to  ©vetflovf  its  embankment 
In  any  event);  West  Texas  Ctilitic*  Co.  v.  Il  rris  231  5AV.2 d 55$  (Tex  Civ.  App. 
1950)  (failute  to  maintain  high  vo!t:.;r  »i.c  at  »Tisutoxx  height  rot  cause  of  dtee 
dent's  electrocution  whose  pipe  hvrd’ed  by  decides. t would  have  touched  wire  tun 
if  it  had  been  placed  at  statutory  height).  Rim  we  Note.  29  Texas  L.  Rev.  691  (1931) 
1* Davis  v.  tk^gw  22  MU.  497,  \>)  Tie.  US  (1921);  Johmtotv  v.  Southern  Tie. 
Co.  105  Cab  App.  $10,  P*c  81  M3 1C)  a,  Faust  v.  Phdidclphii  ! R Ry  Co. 
19t  Pi  420.  4$  Ait  329  (ISKfc  Gregory  v.  Sral^id  Ait  bine  Ry.  Co,  M2  Vi.  75<h 
HI  $ F 272  (192.5), 
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Bu  t where  the  automobile  is  driven  into  the  sixtieth  car  of  a freight 
tram  at  an  open  and  unobstructed  crossing  in  the  night  time,  the 
absence  of  an  mi  lighted  warning  sign  could  scarcely  be  the  cause 
of  the  accident.  “[!]£  travelers  on  the  highway  could  not  see  the 
moving  cars  ahead  of  [them],  they  could  not  see  the  si^n.”29 

• A r<fC,Cnt,  ca!c  !l  15  PerhaP*  introduced  a new  technique  for  deal- 
ing wit h the  equal  probability”  problem,  at  least  in  a limited 
type  of  situations.  In  Summers  v.  Tice  ■«  two  members  of  a hunt- 
ing party  fired  simultaneously  ac  game.  A pellet  of  shot  entered 
the  eye  of  a third  member  of  the  patty  whom  the  others  should  have 
known  to  be  dangerously  near  their  line  of  fire.  There  was  no 

n vT  )Vh°Se  ®un  die  Pc!let  «mc.  In  a suit  against 
both,  the  California  court  held  them  both  liable  to  plaintiff.  Yet 

only  one  defendant  caused  the  damage  and  in  the  case  against 
each  it  is  obvious  that  plaintiif  could  not  prove  this  element  by 
the  greater  probability.  The  result  actually  docs  no  more  violence 
to  tne  equal  probability  rule  than  many  of  the  instances  just  cited 
it  simply  conflicts  with  that  rule  more  dramatically.  The 
court  rankly  faced  this  problem,  pointed  out  that  ordinarily 
defendants  would  be  in  a far  better  position  than  plaintiff  to 
offer  evidence  as  to  cause,  and  concluded,  “When  wc  consider  the 
relative  position  of  the  parties  and  the  results  that  would  flow  if 
■ p.aintiff  was  required  to  pin  the  injury  on  one  of  the  defendants 
requirement  that  the  burden  of  proof  on  that  subject  be 
shifted  to  defendants  becomes  manifest."  =-:  This  is  like  the  tech- 
nique used  by  the  same  court  in  the  res  ipsa  loquitur  case  of 
Ybarra  v,  Spangard 23  but  has  even  greater  claim  to  acceptance 

h^naTc"  '**&"  ap?r<WCl'  of  a tr3in  wa*  w.Vn^wThcplaintiff  until  both 
wL  a)m  , e C[OM‘"g-  * ’!  ,rain  h3vi,'S  failed  to  signal  until  it 

, 2 up°n  tile  crossing,  and  the  noise  of  the  belated  signal  instead  of  alert- 

w ^bir''nfT  ”U,ed  his  5niur)-  Eberts  v.  Chicago  1 N.W.R.  Co.. 

1 (pfatnttflis  horses  frightened  by  train’s  signal);  Waiting  v.  Central 

f °J ,M-.  82  N.J.L.  506,  81  Atl.  937  (1911)  scmble);  Loui  ana  ft  Ik  Ky  Z. 
v.  Ntx.  91  Ark.  270,  126  S.W.  1076  (1910)  (scmble).  Y 

or  J*Zyih:'l'-hC  e"sine  u,aS  not  vi5ible  or  audlb,e  “ C-S-  » it  is  diesel-powered 
of  rhr  . tn  ,8  1'-  e"Sll>-  the  jury  will  be  allowed  to  find  the  failure 

1 cSfunT c T"?  3 'Va^,ng  t0  bC  thC  Pr°xiwnte  Clu'e  of  1 *a»h  occurring  at 
7°3  (1913)  SCC  tjlu ''"gton-Rock  Island  R.  Co.  v.  Ellison,  110  Tex.  353,  167  S.IV.2d 

fJnrTn  “ ^ N OR,R  V-  ComPtcm'  135  Tex.  7,  IS,  136  S.W.2d  1U3,  1115  (1910), 

Sr;™, fn’mio,0'""'  *"  T““  ‘'W”“ 

21 33  Cal.  2d  SO.  199  P.2d  1 (1913). 

22  35  Cal.  2d  at  SG,  199  T.2d  at  4. 

M25  Cal  2d  4S6,  154  P.2d  637  (1911),  93  Cal.  Aw,.  43,  205  l’ .2d  4-13  (1919)  in 
th.s  case  dtseussed  in  §19.7  supra,  and  James,  Proof  of  U.e  Breach  In  Xcg^ence 

So  nur'e  1 1 " I*?’ V?*  tbC  Pb!mi,r  4,:cd  l!‘=  four  doctor?  and 

for  a trail' nV'  - fndcd  ^im  v'  u'c  ,vas  undergoing  appendectomy  for  damages 
for  a traumatic  tnjury  to  his  shoulder  sufiered  »!.ifc  he  teas  unconscious.  The 
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here  where  both  defendants  have  been  shown  negligent.-4  The 
alternative  would  turn  the  innocent  victim  away  without  redness 
and  exonerate  two  admitted  wrongdoers,  one  of  whom  clearly 
caused  the  harm.  Both  the  fault  principle  and  the  objective  of 
compensation  are  flagrantly  violated  by  any  such  result,  and  it 
would  be  a sterile  and  rigid  procedure  indeed  that  would  force  it 
in  the  tcetli  of  all  the  policies  of  the  substantive  lav/.-5  If  wic 
reasoning  of  the  California  court  is  accepted,  interesting  questions 
will  come  up  as  to  its  limits:  Will  it  apply  10  cases  where  one  or 
more  of  the  wrongdoers  is  not  before  the  court  as  a defendant? 
Will  it  cover  the  case  where  each  of  several  defendants  has  caused 
some  harm  but  it  cannot  be  told  how  much?23 

In  this  matter  of  proof  of  causal  relations,  as  in  other  issues, 
qualified  opinion  evidence  may  be  received  in  appropriate  cases.27 

‘ In  connection  with  it  two  questions  of  sufficiency  may  arise.  If  in 
any  given  case  the  question  of  causal  relation  is  so  esoteric  that  lay 
minds  cannot  form  any  intelligent  judgment  about  it  without 

court  said  that  under  the  circumstances  “all  tho?c  defendants  who  hnd  any  control 
over  hii  body  or  the  instrumentalities  which  might  have  <aured  the  injury  may 
properly  be  called  upon  to  meet  the  inference  of  negligence  by  giving  an  explana- 
tion of  their  conduct.'*  25  Cal.  2d  at  -JO r.  151  P.2d  at  691. 

24  Seavey  says  he  knows  of  no  other  case  *'in  which  it  has  been  held  that  the 
plaintiff  has  sustained  the  burden  of  proof  where  there  was  no  evidence  that  it  was 
more  likely  than  not  that  a defendant  caused  the  plnhuifFs  harm.'  Scavcy,  Res 
Ipsa  Loquitur,  Tabula  in  Xaufragio,  63  Hnrv.  L.  Rev.  613,  G4S  (1950).  On  the 
ground  stated  in  the  text,  however,  he  feels  that  the  Summers  case  can  be  more 
readily  supported  than  the  Ybnna  case.  Ibid.  Any  such  distinction  rests,  of  course, 
on  accepting  the  importance  of  fault  ;r>  a criterion  of  liability.  The  reasoning  in 

‘the  Summers  ease  would  probably  have  been  approved  by  such  authorities  as  Wig* 
more  and  Carpenter.  See  Note,  17  111.  L.  Rev.  45$  (1923);  Carpenter,  Workable 
Rules  for  Determining  Proximate  Cau5e,  20  Calif.  I,.  Rev.  229,  39G,  405  (1952). 

Other  cases  have  reached  a like  result  where  the  plaintiff  was  not  a member  of 
the  party,  on  a theory  of  concerted  action  or  joint  enterprise.  Benson  v.  Ross,  143 
Mich.  452,  106  N.W.  1120  (1903);  Oliver  v.  Miles,  144  Miss.  852,  1 10  So.  GC6  (1927); 
cf.  State  v.  New  berg,  120  Ore.  5G4,  273  Tnc,  503  (1929);  Regina  v.  Salmon,  6 Q.B  D. 
79  (1830). 

25  It  should  never  be  forgotten  that  procedure  should  be  the  handmaiden,  not  the 
mistress,  of  justice.  This  docs  not  mean,  o?  course,  that  the  fundamental  guarantee* 
of  procedural  due  process  are  ret  vitally  important.  They  are.  But  an  observance 
of  them  will  always  mean  that  rights  and  liabilities  under  the  substantive  law  will 
in  sorac  eases  be  unenforceable.  And  that  is  a cost,  vot  cn  advantage,  of  these 
guarantees.  Whenever  n procedural  ride  will  consistently  v.*ork  to  subvert  substan- 
tive justice,  it  should  be  scrutinized  to  see  whether  any  fundamental  guarantee  re- 
quires the  rule  to  persist  in  such  a form.  Here  clearly  it  dots  not.  Even  granting 
sacredness  to  the  fault  principle,  there  arc  many  instances  where  procedural  burdens 
have  been  shifted  by  statute  or  judicial  ruling  on  the  basis  of  considerations  like 
those  present  here.  Res  ipsa  loquitur  and  the  presumption  against  bailees  furnbh 
ready  examples.  Var  a narrow  view  of  the  Su mmeis  ease,  however,  see  Noic,  37 
Cco.  L.J.  627  (1948). 

25  See  the  ooinion  of  the  court  in  the  Ybarra  ease,  25  Cal.  2d  4S5,  491,  154  PdM 
637,  691  (1944). 

27  See  note  29  infra  for  some  recent  cases  in  which  it  has  played  a significant  role 
in  the  cause  issue. 
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expert  aid,  opinion  evidence  from  qualified  witnesses  may  be  re- 
quired as  it  sometimes  is  on  the  question  of  appropriate  standards 
of  conduct.29  Probably  no  more  crystallized  rule  than  this  can  be 
formulated.  "’The  courts  have  proceeded  on  a case  to  case  basis, 
and  there  are  few  decisions  denying  recovery  which  have  laid  stress 
on  the  want  of  expert  evidence  as  to  the  cause  in  fact  of  the  acci-^/ 
dent,?9(  It  should  be  noted  that  where  the  doctrine  of  res  ipsa 
loquitur  is  applied  on  the  basis  of  judicial  notice,  expert  evidence 
is  dispensed  with  upon  this  issue  as  upon  any  other.*0'  Expert 
evidence  is  often  required  to  establish  the  causal  connection  be- 
tween the  accident  and  some  item  of  physical  or  mental  injury01 
unless  the  connection  is  a kind  which  would  be  obvious  to  laymen, 
such  as  a broken  leg  from  being  struck  by  an  automobile. 

Where  opinion  evidence  is  relied  on  to  establish  the  causal  link 
a question  of  sufficiency  may  also  arise  in  deciding  whether  the 
opinion  has  been  expressed  in  terms  definite  enough*  to  support 
the  needed  conclusion,  If  a qualified  witness  gives  it  as  his 
opinion  that  the  causal  relation  did  exist  or  probably  did  exist, 
there  is  not  much  question  about  tire  legal  sufficiency  of  the  testi- 
mony.5- But  cautious  or  hostile  witnesses  arc  often  unwilling  to 

2B  See  Morris,  The  Role  of  Expci  t Testimony  in  the  Trial  of  Negligence  Issues, 

26  Texas  L.  Rev.  1,  10  et  scq.  (1017). 

20  Two  such  cases  are  Rlarjcske  v.  Thompson’s  Restaurant  Co.,  325  lit.  Ap p.  189, 

59  NT.2d  320  (1915)  (expert  was  not  asked  his  opinion  as  to  whether  condition  of 
meat  caused  plaintiff’s  illness;  held , question  of  proximate  cause  left  in  realm  of 
ipccuhtion,  surmise  and  conjecture  e\en  though  nearly  all  other  evidence  pointed 
very  strongly  to  a causal  relation),  and  Goodwin  v.  Misticos,  207  Miss.  361,  42  So. 

2d  397,  402  (1919).  (“The  fact  whether  the  matter  vomited  up  by  Mr.  Goodwin 
contained  corned  beef  that  seas  infected  by  poisonous  bacteria  v:as  also  or.c  capable 
of  direct  and  demonstrative  proof  by  a chemical  analysis”).  There  are  several  recent 
cases  in  which  such  evidence  was  offered  where  the  court  might  well  have  found 
the  lack  of  it  fatal.  See  Foley  v.  Pittsbuigh-Dcs  Moines  Co.,  303  Pa,  1,  68  A.2d 
617  (1919)  (expert  traced  explosion  of  liquefied  natural  gas  to  negligence  of  manu- 
facturer of  tank  in  which  it  was  stored),  and  National  bead  Co.  v.  Schuft,  176  F.2d 
610  (Sth  Cir.  1949)  (experts  ascribed  puffs  of  (lame  and  explosions  to  large  amounts 
of  floating  humus  dust  in  the  air).  Ct.  Western  Tel.  Corp.  v.  McCann,  128  Tex.  5S2, 

99  $.W.2d  S95  (1937)  (defendant’s  experts  successfully  disproved  any  causal  con- 
nection betv/cen  a stroke  of  lightning  coupled  with  the  admitted  negligence  of  the 
defendant  in  the  management  of  its  wires  and  the  death  of  plaintiff's  wife  by 
electrocution). 

In  Comeau  v.  Beck,  319  Mass.  17,  64  N.E.2d  430  (1910),  it  was  the  testimony  of 
the  defendant's  medical  expert  to  the  effect  that  a miscarriage  might  be  produced 
by  "'some  injury,  [or  by]  the  striking  of  the  abdomen/  [which]  was  enough  to 
support  a finding  that  the  plaintiff’s  imscairiage  was  causally  related  to  the  acd* 
dent,” 

30 See  §19.5  supra;  Janies,  Proof  of  the  Breach  in  Negligence  Cases,  37  Va.  L. 

Rev.  179,  197  n 53,  201,  202  (1951).  Cf.  Judson  v.  Giant  Powder  Co.,  107  Cal.  549, 

40  Pae.  1020,  29  L.R.A.  71S,  18  Am . St.  Rep.  146  (1S93),  and  the  discussion  oE  this 
point  in  Morris,  Res  Ipsa  Loquitur  in  Texas,  26  Texas  L.  Rev.  237,  261-262 
(194S). 

31  See  cases  cited  in  note  33  infra. 

32  TuUgien  v.  Amoshcag  Mfg.  Co.,  $2  N.H.  2GS,  133  Atb  4 (C92G);  National  Lead 
Co.  v.  Schuft,  176  F.2d  610  (3th  Cir.  1949);  Avcrdicck  v.  Harris,  S7  Cal.  App.  620,  262 
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go  so  far,  and  there  are  serious  questions  and  variant  rulings  about 
the  sufficiency  of  such  expressions  as  “may  have  cau<<\i.M  “possibly 
caused, M and  the  likc/1jy  Of  course  other  factors  in  the  case  may 
lend  support  to  the  inference.  Thus  in  a nii.'-caniage  ease  the 
Massachusetts  court  has  held  such  an  expression  sufficient  when 
“taken  in  connection  with  the  plaimills  testimony  dm  [her] 
health  was  good  before  the  accident’*  though  otherwise  It  would 
leave  “the  issue  treml.ding  in  the  balance/* 31 
A distinction  should  be  emphasized  at  this  point.  We  are  con- 
cerned here  with  the  sufficiency , not  i he  weight  of  the  evidence, 
and  on  familiar  principles  plaintiff  is  entitled  to  have  sufficiency 
tested  on  the  basis  of  the  evidence  most  favorable  to  him.35''  ft  is 

Tac.  423  (1927);  Shepard  v.  Ciriiaiion  Miik  Co..  220  Iowa  *(’S,  262  X.W.  110  (1935). 

In  many  ca*cs  each  side  has  at  lean  one  expert  witness  who  testifies  concerning 
the  cause  of  the  injury  in  definite,  untquhoc.d  language.  The  experts  thus  dia- 
metrically opposed,  a jury  verdict  for  either  parly  will  be  upheld.  Sec  Karris  v. 
Wood,  214  Minn.  492,  S N.W.2d  SIS  (19 iJ)  (strong  txpcit  testimony  to  support  two 
alternative  theories  ns  to  death  of  dee cilcm.’ only  one  of  which  involved  defendant’s 
negligence;  jury  verdict  for  phinihl  upheld),  discussed  in  Morris,  Pa  xitn me  Cause 
in  Minnesota,  34  Minn.  H.  He.’.  1.V5.  137  (1959). 

33 ''Possibility”  of  causal  relation  iiisrdfincn*:  Mutual  Hen.  Health  Accident 
A«n.  v.  Johnson,  ISC  So.  297  (Mi«s.  1939);  Wood  v.  W.  E.  Joyce  Co.,  22$  Anp.  Div. 
729,  239  N.Y.  Sitpp.  110  (1939);  Hainan  v.  \cv:  England  Tcl/fc  Tel  Co.,  290  Mass. 
219,  5 N.E.Sd  209  (1930);  Gerber  v.  Wltn.'cm;<k),  l>$  ^ . JO 4,  2;5  NAV.  200  (1925). 
In  Western  Truck  Hines  v.  Hen),  53  Arul  215,  37  l‘.2d  4S4,  -186  (19.59)  testimony 
that  ‘one  might  . . , say  that  an  accident  . . . may  or  may  not  have  any  tiling  to 
do  with  it"  was  held  “entirely  too  indefinite  and  .murrain"  to  establish  the' requisite 
causal  relation.  A similar  result  was  reached  in  Di  Fa  do  v.  J.  G.  Hi  ill  Co.,  133 
Pa.  Super.  576,  3 A 2d  216  (195$),  the  court  sluing  rhat  expert  tc-aimony  that  a 
causal  connection  was  “extremely  possible  and  like!)’*  would  be  insufficient. 

Of  course  if  the  cause  and  tc  tilt  ichtinnship  of  the  Upe  which  the  phimilf  is 
urging  as  the  ground  of  the  defendant  liability  ocaits  in  only  one  out  of  one 
hundred  eases  there  is  insufficient  evidence  of  ii  to  go  to  the  jury  in  the  individual 
case  at  bar.  Krnnier  Service  v,  Wiikens.  lot  Mbs.  453,  ISO  So.  625  (1939).  But  a simi- 
lar result  was  reached  in  llowLy  v.  Kantor,  195  Vt.  125,  1(53  All.  625  (1933),  even 
though  the  percentage*  great!)  favored  (he  pUinlhT  (eighty  per  cent  of  cares  in- 
volved the  more  serrous  type  of  tumor  for  which  she  was  ashing  damages)  on  the 
grounds  that  the  cxpcit  testimony  was  not  dhtUed  to  the  likelihood  of°  the  plain- 
tiffs particular  turner  being  of  the  more  serious  type. 

On  the  other  hand  in  Corrd nooks  v.  Icwmual  Harbor  Shops,  Inc.,  252  X.Y.  217, 
26  IS.E.2d  25  (1910),  expert  evidence  (hat  a bn i bur's  electric  vibrator  "would  have 
been  a competent  producing  cause"  of  a detachment  of  tire  retina  (together  with 
absence  of  proof  of  other  piobab.c  cause)  was  held  to  satisfy  the  requirement  as  to 
probability.  The  Appellate  Division  hud  fk  nvi  this  evidence  insufficient  to  estab- 
lish the  causal  connection.  Z53  App.  Div.  522,  8 N.Y.S  2d  171  (1035).  The  case 
of  Wood  v.  W.  E.  Joyce  Co.,  jrrjnc,  aL-o  inuTed  a claim  of  traumatic  detachment 
of  the  retina,  with  the  elimination  of  tuber  probable  causes. 

In  Dunham  v.  Village  of  Caniitco,  503  NY  4 VS,  101  NT. 2d  S72  (1952),  a doctor’s 
testimony  that  “it  was  a icascuublc  medical  cutainLy  that  the  delay  of  eighteen 
hours  [in  providing  mcJicil  aid]  contributed  to  Mr . Dunhams  death"  was  held 
enough  to  establish  a causal  relation,  Mlhuugh  lire  doctor  aUo  admitted  that  the 
same  result  might  have  happened  if  'hole  hud  been  prompt  medical  care. 

34  Comcau  v.  Beck,  319  Masw  17,  20.  Cl  NX  2d  436,  437  (I9!G);  ci.  De  Filippo’s 
Case,  284  Mass.  531,  534,  155  NT.  245,  217  (1933). 

35  0 Wigmore,  Evidence  $$2191,  2-95  (Jd  cd.  1240). 
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only  to  that  version  of  the  evidence  therefore  that  the  equal  proba- 
bility test  (of  the  legitimacy  of  an  inference)  may  properly  be 
applied.  It  is  the  jury’s  function,  not  the  court’s,  to  say  whether 
plaintiff's  version  of  the  case  is  sustained  by  the  greater  weight  of 
the  evidence.3*  A case  therefore  in  which  the  evidence  most  favor- 
able to  plaintiff  (including  the  opinion  of  qualified  experts)  affords 
the  basis  for  an  inference  of  cause  is  not  defeated  as  matter  of  law 
by  defendant’s  evidence  of  a different  version  of  facts  or  a different 
theory  of  cause  in  fact.37  Where  the  evidence  stands  in  that  pos- 
ture, the  question  of  what  probably  happened  in  the  case  is  for  the 
jury  even  though  the  court  might  be  inclined  to  accept  the  de- 
fendant’s version. 3S 

There  is  another  type  of  question  that  presents  problems  of 
proof  of  fact,  which  is  sometimes  treated  under  the  heading  of 
cause.39  This  is  the  question  found  in  negligent  entrustment  and 
unlicensed  operator  cases,  namely,  whether,  in  the  former,  the  fact 
of  the  driver’s  incompetence  can  be  vised  as  evidence  that  the  ac- 
cident occurred  as  a result  of  this  incompetence,  and,  in  the 
latter  cases,  whether  the  lack  of  license  is  evidence  that  the  driver 
was  unskillful  in  a way  which  contributed  to  the  accident.  De- 
ss Id.  §2551. 

37  Unless,  in  any  given  case,  the  defendant's  evidence  is  so  overwhelming  that 
under  general  principles  reasonable  minds  could  not  reject  it  and  accept  the 
plaintiffs. 

33  The  actual  decisions  recognize  this.  National  Lead  Co.  v.  Schuft,  176  F.2d  610 
(8th  Cir.  1919);  Scars  Roebuck  Sc  Co.  v.  Scroggins,  1*10  F.2d  71S  (8th  Cir.  1914); 
Turner  v.  Minneapolis  St.  Ry.  Co  , HO  Minn.  243,  107  N.W.  10U  (191S);  Tuitgrcn 
v.  Amoskcag  MEg.  Co.,  $2  NIL  263,  153  Atl.  4,  40  A.L.R.  3S0  (192G);  Foley  v.  Pitts- 
burgh-Dcs  Moines  Co.,  SG3  Pa.  1,  6S  A. 2d  517  (1919).  But  the  reasoning  in  these 
cases  sometimes  fails  to  describe  with  accuracy  the  basis  for  hoMing  inapplicable  the 
rule  that  a jury  will  not  be  allowed  to  guess  between  two  equally  probable  causes. 
Thus  in  the  Tullgrcu  case  the  court  says  this  equal  probability  rule  “has  no  applica- 
tion unless  the  existence  of  a sufficient  cause  or  causes  for  the  injury  aside  from  the 
negligence  charged,  is  conceded  or  conclusively  proved/*  155  Atl.  at  7.  This  state- 
ment overlooks  the  fnct  that  it  is  the  weakness  of  pLuntifJ's  case  that  usually  brings 
the  restrictive  rule  into  play  and  seems  to  suggest  instead  that  a plaimifF  can  get  to 
the  jury  whenever  defendant's  explanation  falls  shoit  of  certainty.  Unfortunately, 
perhaps,  the  broad  literal  implications  of  this  statement  do  not  seem  to  be  borne  out 
by  the  cases.  See,  however.  Carpenter,  Workable  Rulc>  for  Determining  Proximate 
Cause,  20  Calif.  1..  Rev.  39G,  399  (1952).  Possibly  the  court  in  the  Tullgrcn  case 
o:  ly  meant  to  say  that  “where  a cause  is  present  which  might  produce  a condition 
that  has  occurred,  and  there  is  no  other  adequate  cause  proved,  the  matter  is-  not 
Krthin  the  equipoise  rule,  and  it  may,  under  such  conditions  be  found  that  the 
only  thing  suggested  by  the  evidence  which  could  cause  an  effect  found  present, 
did  cause  it.”  Turner  v,  Hartford  F.  Ins.  Co.,  I S3  row  a 1303,  1377,  172  N.W.  ICG, 
172  (1919).  As  v;e  have  seen,  nearly  all  the  cases  which  employ  even  this  relaxation 
of  the  equal  probability  formula  could  with  equal  logic  have  been  decided  for  the 
defendant  because  plaintiffs  proof  of  cause  rc>ted  on  conjecture.  For  examples,  sec 
cases  cited  in  notes  16,  17  and  19  supra. 

33  We  shall  deal  with  the  problem  of  proof  litre,  ami  leave  for  later  discussion  the 
problem  of  whether  the  question  should  be  asdmihtcd  to  the  issue  of  cause  or  the 
issue  of  duty.  See  §20.5,  notci  15-17  infra. 
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fendant  entrusts  Ms  automobile  to  one  lie  should  know  is  inconi, 
petent  and  the  latter  injures  plaintiff,  a lawful  user  of  the  high- 
way. Under  these  circumstances  the  owner's  act  is  clearly  a breach 
of  duty  towards  plaintiff.  But  there  remains  a further  question 
before  liability  may  be  imposed,  This  may  be  put  in  terms  of 
causal  relation  between  the  negligence  and  the  harm  or  in  terms 
of  whether  the  injury  arose  out  of  the  risk  which  made  defendant’s 
entrustment  negligent.  In  either  event  the  factual  inquiry  is  the 
same,  did  the  injury  occur  because  the  driver  (borrower)  teas  in- 
competent? Neither  analysis  will  permit  recovery  if  the  driver 
was  acting  with  all  due  care  on  this  occasion.  The  result  is  that 
the  negligence  of  the  driver  must  be  proven  (just  as  in  a case  of 
vicarious  liability,  but  for  different  reasons).  The  only  question 
is  whether  proof  of  his  incompetence  which  was  introduced  to  ✓ 

show  the  owner’s  negligence  may  also  be  used  as  evidence  of  negli- 
gent operation  in  the  particular  accident  in  the  case.  The  courts 
have  said  not,  since  on  grounds  of  doubtful  validity  the  evidence 
would  have  been  incompetent  if  offered  for  the  latter  purpose.40 

Where  the  car  is  entrusted  to  an  unlicensed  driver,  or  where  * 

defendant  himself  drives  it  without  a license,  a similar  question 
is  presented  (if  the  licensing  statute  is  thought  to  have  safety  in 
mind  and  not  exclusively  revenue,  or  some  other  unrelated  pur- 
pose), namely,  did  the  harm  result  from  the  want  of  a license, 
or  (more  accurately)  because  of  the  risk  of  unskilled  driving  which 
the  licensing  statute  sought  to  prevent?  If  the  car  was  driven 
with  all  the  skill  and  care  the  law  requires,  then  the  harm  did  not 
proceed  from  such  a risk;  if  the  car  was  negligently  driven,  then 
it  did.  The  further  question  then  arises:  may  the  want  of  license 
be  considered  as  some  evidence  that  unsk:llfulness  contributed  to 
the  accident?  Some  courts  say  no.41  But  this  effectively  deprives 
the  victim  of  any  benefit,  in  a civil  case,  of  a statute  concedcdly 
passed  in  part  for  his  protection.  Nor  docs  the  ruling  seem 

4°  In  nearly  all  jurisdictions  the  character  or  propensity  for  carelessness  of  a per- 
son charged  with  a negligent  act  may  not  be  put  in  evidence  even  though  it  might 
throw  light  “on  the  probability  of  his  having  acted  carelessly  on  the  occasion  in 
question."  See  1 Wigmorc,  Evidence  £03  (3d  cc.  1010).  This  matter  is  discussed  in 
James  and  Dickinson,  Accident  Pror.cness  and  Accident  taw,  G3  Haev.  L.  Rev.  769, 

791  (1950). 

41  Mahowald  v.  Bcckrich,  212  Minn.  7S,  2 N.W.2J  560  (19x2),  lists  several  of  the 
jurisdictions  in  which  both  questions  in  the  text  have  been  answered  in  the  nega- 
tive. See  also  Morrison  v.  Lc  Toutiu.au,  158  F.2d  339  (5th  Cir.  1913):  "There  is 
argument  that  the  possession  of  only  a pilots  license  is  evidence,  or  the  basis  of  a, 
presumption  that  the  pojsessor  is  larking  in  skill  as  a [commercial]  pilot.  This 
does  not  ncctssarily  follow.  He  miy  net  have  chosen  to  take  further  examinations 
although  he  might  have  been  abundantly  qualified  for  commercial  or  transport  li* 
cense  for  aught  the  record  shows.  . . ."  13S  F.2d  at  311  (emphasis  ad  Jed).  UT.y 
docs  the  court  insist  on  talking  about  possihililic s which  are  patently  less  than  proba- 
ble? Smith  v.  Whitley,  223  N.C.  334,  27  ST. 2d  412  (1913),  is  similar.  The  classic 
opinion  is  that  in  Brown  v.  Shjne,  212  NX  176,  151  NT.  197  (1026). 
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theoretically  sound.  The  legislature  has  decided  that  the  general 
safety  requires  the  activity  in  question  to  be  limited  to  those  who 
can  demonstrate  at  least  a minimum  of  special  skill.  It  is  fair  to 
assume  that  the  unlicensed  as  a class  are  far  less  likely  than  those 
licensed  to  have  the  skill  for  which  a license  is  required.  And  it  is 
certainly  true  that  when  the  unskilled  attempt  what  it  takes  skill  to 
dor  some  of  the  intangible  factors  that  go  to  make  up  lack  of  skill 
are  far  move  likely  than  not  to  have  contributed  to  any  mishap 
that  occurs.  Frequently  they  do  this  in  ways  that  arc  hard  to 
prove;  moreover,  what  evidence  there  is  in  the  matter  is  likely  to 
be  in  the  defendant’s  hands.  Both  probability  and  policy,  there- 
fore, call  for  the  rule  that  breach  of  a licensing  statute,  if  it  is 
negligence  to  the  plaintifl  at  all,  should  be  prima  facie  evidence 
that  it  is  the  ‘‘proximate  cause”  of  any  injury  that  ensues.4- 


42  Cf.  Martin  v.  Herzog,  22S  N.Y.  104,  126  N.F..  S 14  (1920)  (fact  that  unlighted 
vehicle  was  struct;  by  another  in  the  dark  is  evidence  that  the  collision  occurred  be* 
cause  of  the  lack  of  lights).  Dean  Green  agrees  with  the  position  stated  in  the  text. 
See  discussion  of  Mundy  v.  Pirie-Slaughtcr  Motor  Co.,  146  Tex.  314,  206  S,\V.2d 
5S7  (1947),  in  Green,  Proximate  Cause  in  Texas  Negligence  Law,  28  Texas  L.  Rev. 
755,  7CG  (1930),  and  see  Green,  Proximate  Cause  in  Connecticut  Negligence  Law,  24 
Conn.  BJ.  24,  32  (1930).  If  the  rule  suggested  here  should  emerge,  it  might  well  be 
qualified  by  holding  that  the  Inference  of  negligence  from  want  of  a license  is 
rebutted  by  a showing  that  the  actor  was  qualified  for  a license,  so  far  as  skill 
was  concerned. 

The  lack  of  registration  foi  an  automobile  Involved  in  an  accident  evokes  varied 
responses  from  different  courts.  In  the  great  majority  of  jurisdictions  the  driver 
of  an  unregistered  automobile  is  not  liable  to  those  he  injures  if  he  is  otherwise 
exercising  due  care.  Gilman  v.  Central  V.  Ry.  Co.,  93  Vt.  3*10,  107  Atl.  122  (1919); 
Armstrong  v.  Sellers,  1S2  Ala.  532,  62  So.  26  (1913).  See  annotation,  163  A.L.R.  1375 
(1946)  (citing  earlier  annotations),  and  cases  cited  by  Carpenter,  Workable  Rules  for 
Determining  Proximate  Cause,  20  Calif.  L.  Rev.  390.  410  n,J33  (1932).  This  result 
should  be  reached  not,  as  many  courts  reason,  because  the  violation  of  the  statute 
is  not  the  proximate  cause  of  the  injuiy  (see,  eg.,  Gonchar  v.  Kelson,  114  Conn.  262. 
153  Ail.  545  (1932)),  but  because  the  plaintiff  is  not  within  the  scope  of  duty  owed 
by  the  defendant  with  respect  to  registration.  See  discussion  ct  the  Gonchar  case  by 
Green,  Proximate  Cause  in  Connecticut  Negligence  law,  24  Conn.  3.J.  24,  28-30 
(1950).  See  further  discussion  of  this  point  in  the  text,  $20.5  at  note  IS  infra. 

Massachusetts  is  probably  the  only  state  which  rttll  considers  the  unregistered  ve- 
hicle an  outlaw  or  trespasser  on  the  highway  to  which  no  duty  is  owed,  Chase  v.  N.Y. 
Cent,  k H.R.R.  Co.,  203  Ma«s.  137,  91  NT.  377  (1911),  and  which  is  responsible  for 
all  harm,  regardless  of  negligence.  McDonald  v.  Dilation,  242  M-ss.  229,  ISO  N.E. 
264  (1922).  For  sonic  of  the  ridiculous  bases  upon  which  an  automobile  has  been 
found  to  be  unregistered  in  Massachusetts,  see  Comment,  10  U U.L.  Rev.  211  (1930). 
But  Massachusetts  has  apparently  come  to  the  point  where  it  is  at  least  unwilling 
to  extend  this  rule  to  new  situations.  See  Galbraith  v.  Levin,  323  Mass.  255,  31 
N.E2d  560,  563  (194$),  in  which  (he  owner  was  held  not  liable  for  injuries  inflicted 
by  a thief  who  was  negligently  driving  his  car  which  was  unregistered  and  Ir  which, 
in  violation  of  i .statute,  he  had  left'  the  keys.  This  decision  directly  overruled, 
Malloy  v.  Newman,  310  Ma*i.  269,  37  N.E  2d  1031  (1911),  which  involved  very  similar 
facts  and  which  rested  heavily  on  the  fact  that  the  automobile  was  unregistered. 


CORRELATION  AND  CAUSATION 


The  discussion  by  Mueller  and  Schuessler  concerning  cor- 
relation notes  that  correlation  does  not  necessarily  indicate 
causation.  At  the  same  time,  it  is  true  that  all  observers  of 
human  behavior,  including  lawyers  and  social  scientists,  engage 
in  the  analysis  of  causal  relationships  and  depend  upon  correlation 
to  do  so.  And  correlations  can  lead  to  inferences  about  the 
operation  of  causation — although  they  cannot  prove  it  in  a final 
sense.  The  task,  then,  is  to  place  correlation  in  its  proper 
perspective  with  regard  to  causation;  and  we  feel  that  the  follow- 
ing reading  provides  an  excellent  discussion  of  the  problem. 


From  An  Introduction  to  Logic  and  Scientific 
Method  by  Morris  R.  Cohen  and  Ernest  Nagel,  copyright, 
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§ 1.  TYPES  OP  INVARIANT  RELATtONS 


The  search  for  order  among  facts  Is  a difficult  task.  Few  suc- 
ceed in  it.  But  it  has  always  been  the  hope  of  some  thinkers  that 
easily  learned  rules  might  be  found  according  to  which  anyone 
undertaking  such  a task  may  be  assured  that  success  will  crown  his 
efforts.  And  some  writers  on  scientific  method  have  proudly  be- 
lieved they  had  actually  found  such  rules.  Francis  Bacon  was  one 
of  them.  “Our  method  of  discovering  the  Sciences/'  he  wrote,  “is 
such  as  to  leave  little  to  the  sharpness  and  strength  of  men's  svits, 
but  to  bring  all  wits  and  intellects  nearly  to  a level.  For  as  in  draw- 
ing  a straight  line,  or  describing  an  accurate  circle  by  the  unas- 
sisted hand,  much  depends  on  its  steadiness  and  practice,  but  if  a 
rule  or  a pair  of  compasses  be  applied,  little  or  nothing  depends 
upon  them,  so  exactly  is  it  with  our  method/' 1 The  methods  which 
Bacon  recommended  for  discovering  the  causes  of  things  are  popu- 
larly believed  to  express  the  nature  of  scientific  method.  They  were 
elaborated  by  John  Stuart  Mill  and  formulated  by  him  as  the 
methods  of  experimental  inquiry. 

Before  examining  these  methods,  let  us  note  some  preliminary 
difficulties.  In  Chapter  XI  we  tried  to  indicate  that  isolated  facts 
do  not  constitute  a science,  and  that  the  object  of  science  is  to  find 
the  order  among  facts.  But  what  kind  of  order?  It  is  commonly 
believed  that  science  is  interested  exclusively  in  causal  order.  The 
analysis  of  the  meaning  of  “causality”  is  a most  difficult  task.  We 
cannot  enter  upon  it  here,  for  it  is  not  a task  for  die  logician.  But 
we  must  observe  that  various  kinds  of  order  are  sometimes  con- 
founded as  identical  with  the  causal  order.  We  may  dien  be  pre- 
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pared  to  state  the  general  character  of  the  sort  of  order  for  which 
the  sciences  are  in  search. 

1.  There  is  a type  of  order  so  familiar  that  it  generally  escapes 
notice.  AH  of  us  recognize  certain  things  as  water,  other  things  as 
wood,  still  others  as  steel,  and  so  on.  Why  do  we  assign  special 
names  such  as  “water,"  "wood  " and  "steel"? 

We  apply  the  name  "water"  to  something  which  is  a liquid  when 
above  a definite  temperature,  a vapor  when  above  another  tempera* 
ture.  This  "substance"  is  generally  translucent,  odorless,  colorless; 
it  has  a constant  density,  and  is  practically  incompressible;  it 
quenches  fire  and  thirst.  "Water"  denotes  a constant  conjunction  of 
properties , and  a name  is  given  to  it  in  order  to  distinguish  it  from, 
other  such  conjunctions  or  "things."  So  also  for  "wood"  and  "steel." 

The  vague  concept  of  "thing"  denotes,  therefore,  a vc ry  elemen- 
tary but  fundamental  type  of  order.  It  denotes  a certain  invariable 
conjunction  or  association  of  properties  that  is  different  from  other 
conjunctions.  Such  a type  of  order  would  probably  never  be  called 
"causal."  But  the  discovery  of  tins  kind  of  order  is  fundamental 
for  the  discovery  of  any  other  kind  of  physical  order.  Different  kinds 
of  things  have  been  recognized  in  the  most  primitive  stages  of  man's 
history.  But  the  process  of  classifying  and  cataloguing  our  ex* 
periences  is  not  complete,  and  perhaps  never  will  be. 

2.  -A  type  of  order  frequently  recognized  is  one  involving  a tem- 
poral span  or  direction.  "Iron  rusts  in  moist  air,"  expresses  one  in- 
stance of  such  an  order.  It  is  this  kind  of  order  that  is  generally 
regarded  as  causal. 

The  "common-sense"  notion  of  cause  is  an  interpretation  of 
nonhuman  behavior  in  terms  peculiarly  adopted  to  human  behavior. 
Thus,  "John  broke  the  window,"  is  supposed  to  express  a causal 
relation,  because  there  is  an  agent  "John"  who  produced  the  break- 
ing of  die  window.  So  in  the  proposition  above,  the  moist  air  is 
said  to  be  the  cause,  of  which  the  rusting  iron  is  the  effect.  And  the 
moist  air  is  said  to  be  the  cause  because  it  is  believed  to  produce 
the  rusting.  In  the  popular  mind,  all  changes  require  causes  to  ex- 
plain them,  and  when  found  are  interpreted  as  agents  producing 
the  Change. 

It  is  very  difficult  to  make  clear  what  is  understood  by  causes 
. "producing"  their  effects.  When  we  reflect  on  the  matter,  all  that 
we  seem  capable  of  discovering  in  alleged  instances  of  causality, 
is  an  Instance  of  an  invariable  relation  between  two  or  more  proc* 
esses.  It  is  not  the  mere  existence  of  John  which  is  the  cause  of  the 
broken  window.  What  is  important  is  the  constant  edition  which 
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holds  between  a certain  kind  o£  behavior  of  John  and  a certain 
other  kind  of  behavior  of  the  glass.  One  of  the  cha.  ^ctciistics  of  the 
causal  relation  as  generally  understood  is  that  it  is  asymmetrical 
and  temporal  in  nature. 

But  even  common  sense"  soon  recognizes  chat  the  apparent  in- 
variability of  alleged  causal  relations  is  often  specious.  Iron  does 
not  always*  rust  in  moist  air,  and  a window  is  not  always  broken 
when  a brick  is  hurled  at  it.  Even  “common  sense'*  discovers  that 
other  factors  must  be  present  in  these  situations  besides  those 
already  noted.  Hence  it  is  not  moist  air  alone  that  is  the  cause  of 
rusting.  Those  other  factors  are  then  sought  for  which  seem  to  be 
necessary  for  tlte  occurrence  of  the  effects.  In  this  way  there  is  a 
gradual  transition  from  tlte  crude  and  approximate  uniformities 
observed  in  everyday*  experience,  to  tlte  more  completely  analyzed 
invariant  relations  of  the  developed  sciences. 

5.  Many  uniformities  are  expressible  by  numerical  equations. 
Ohm’s  law  in  electricity  states  that  the  current  is  equal  to  the  po- 
tential difference  divided  by  the  resistance.  The  principle  of  the 
lever  states  that  equilibrium  is  obtained  when  the  two  weights  vary 
inversely  to  their  distances  from  the  fukvuiu. 

^ Invariable  relations  of  this  type  no  longer  assert  a sequence  in 
time,  and  they  are  probably  never  regarded  as  illustrations  of  causal 
order.  Ic  is  true  that  in  making  experiments  upon  an  electric  cir- 
cuit we^  may  alter  the  current  first  and  subsequently  note  the’ 
change  in  the  potential  difference.  But  what  Ohm's  Izvr  states  is 
not  the  order  in  which  we  make  observations;  it  states  that  the 
measurable  elements  observed  stand  to  each  other  in  the  specified 
invariable  relations. 

fourth  type  of  order  is  illustrated  in  such  comprehensive 
theories  as  the  theory  of  gravitation,  or  the  kinetic  theory  of  matter, 

In  such  theories,  not  all  the  elements  between  which  the  invariant 
relations  are  asserted  to  hold  are  directly  observable.  No r are  all 
the  relations  which  are  asserted  to  hold  between  the  dements  ca. 
pable  of  direct  experimental  confirmation.  Thus  the  atoms,  their 
^motions  and  collisions,  the  invariance  of  their  average  energies, 
cannot  be  verified  directly.  ° * 

The  function  of  such  comprehensive  theories,  as  we  have  already 
seen,  is  that  they  enable  us  to  show  that  many  numerical  as  well  a? 
qualitative  laws  rvhich  are  in  fact  experimentally  confirmable,  are 
not  isolated  from  one  another.  Such  laws  can  often  be  shown  to  be 
the  necessary  consequences  of  the  more  abstract  and  inclusive  order 
asserted  in  the  theory.  Thus  the  numerical  relations  between  the 
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temperature,  the  volume,  and  the  pressure  of  gases;  the  numerical 
laws  connecting  the  density  and  the  specific  heat  of  gases;  the  rela- 
tions between  melting-point,  pressure.  and  volume  of  solids— these 
are  all  derivable  by  logical  methods  from  the  assumptions  of  the 
kinetic  theory  of  matter. 

If  we  examine  these  four  genera!  types  of  order,  we  discover  that 
a generic  feature  of  all  of  them  is  assertion  of  some  kind  of 
invariable  relation  between  various  kinds  of  elements.  The  relation 
in  some  cases  may  involve  a tern  pore  l asymmetry;  the  cause  is 
popularly  said  to  precede  the  cilect  in  time.  In  other  cases  the 
temporal  reference  is  missing,  ft  is  the  iuv  inability  which  seems  to 
be  significant,  both  theoretically  md  practically. 

By  the  cause  of  some  effect  v. •;  ilvil1  understand,  therefore,  some 
appropriate  factor  invariably  nda  ed  to  the  effect.  If  A has  diph- 
theria at  lime  t is  an  effect,  wc  sh  «!l  understand  by  its  cause  a cer- 
tain  change  C,  such  that  the  fol  d ing  holds.  If  C takes  place,  then 
A wall  have  diphtheria  at  tine  \ and  if  C does  not  take  place  A 
will  not  have  diphtheria  at  time  and  this  is  true  for  all  values  of 
A , C , and  l,  where  A is  an  indiv:«h:r.l  t-f  a certain  type,  C an  event 
of  a certain  type,  and  t the  time. 

The  search  for  “causes"  may  .k-refore  be  understood  as  a search 
for  some  invariable  order  between  \arious  sorts  of  elements  or  fac- 
tors. The  specific  nature  of  this  urdsr  will  vary  with  the  nature  of 
the  subject  matter  and  the  purple  of  the  inquiry'.  Moreover,  the 
Specific  nature  of  the  elements  b::w<cn  which  the  order  is  sought 
will  also  differ  for  different  inquiries  In  some  cases  we  already 
know  the  invariable  order  :md  -orn.c  of  *he  elements,  and  then  our 
search  is  for  one  or  more  Fur  the:  chnnont«.  Thus,  finding  a person 
dead  from  wounds  and  knowing  the  conditions  under  which  such 
a death  results,  we  look  for  the  murderer.  In  other  cases,  we  may 
know  the  elements  and  search  for  an  invariable  order  between 
them.  Thus  we  may  note  hot  wittc  b.dng  poured  into  a glass  and 
also  note  that  the  glass  cracks;  we  ir.ay  then  look  for  the  structure 
of  the  relations  connecting  two  such  processes.  In  still  other  cases, 
we  may  notice  some  change  ar.d  dion  look  for  other  as  yet  un- 
known changes  connected  with  it  in  some  ways  not  yet  known. 
Thus  wc  may  note  the  aurora  borealis,  and  then  search  for  the  cir- 
cumstances with  which  it  is  connected  in  some  fashion  or  other. 

The  kind  of  elements  or  changes  for  which  we  look  depends  on 
the  structure  of  the  order  in  which  v.e  are  interested.  The  answer 
to  the  question,  "Who  killed  the  Archduke  Ferdinand  at  Sarajevo?" 
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must  be  of  the  form:  “The  person  or  persons  A,  B,  C and  so  on,  are 
the  assassins  of  the  Archduke.”  On  the  other  hand,  the  question, 
"What  killed  the  Archduke?”  must  be  answered  according  to 
the  kind  of  specific  order  for  which  tee  are  in  search,  and  according 
to  the  purpose  of  the  inquiry.  One  answer  may  be,  "A  certain  re- 
volver was  the  cause  of  his  death."  Other  possible  answers  are: 
"Certain  social  and  political  conditions  were  the  cause  of  his 
death”;  "The  cessation  of  the  oxygen  tupply  to  the  cells  of  his  tody 
was  the  cause  of  his  death."  In  other  svords,  the  kind  of  order,  as 
well  as  the  kinds  of  elements  v.e  look  for,  is  determined  by  the  na- 
ture of  the  problem  svhfch  generates  the  inquiry.  What  is  an  ade- 
quate arvssvcr  to  one  question  will  not,  in  general,  be  adequate  to 
another. 

In  the  light  of  the  great  variety  in  the  kinds  of  specific  orders  and 
fretors  svhich  may  be  the  objects  of  an  inquiry,  it  may  seem  pre- 
posterous to  believe  that  any  general  rules  can  be  stated  svhich 
will  enable  us  to  find  satisfactory  answers  to  all  possible  problems. 
We  shall  not  prejudge  the  matter,  however,  and  shall  examine  at 
length  the  experimental  methods  formulated  by  Mill. 

; | S.  THE  EXPERIMENTAL  METHODS  IN  GENERAL 

The  “experimental  methods,”  according  to  their  author,  have  a 
twofold  function  to  fulfill.  They  arc,  in  the  first  place,  methods  of 
discovering  causal  connections.  Mil!  believed  that  by  emptoying  the 
methods  the  order  in  svhich  facts  stand  to  one  another  can  be 
found.  Against  his  critics  he  maintained  that  all  inferences  from 
experience  are  made  by  means  of  these  methods.  If  “no  discoveries 
were  ever  made  by  the  . . . methods  . . . none  svere  ever  made  by 
observation  and  experiment;  for  assuredly  if  any  svere,  it  svai  by 
processes  reducible  to  one  or  other  of  those  methods.”*  It  is  these 
methods  svhich  supply  the  first  ger.cnlintions  upon  svhich  atl  sub- 
sequent construction  of  hypotheses  depends.’ 

But  secondly,  the  methods  have  a demonstrative  function  as  well. 
Mill  conceived  the  business  of  logic  to  be  concerned  tvith  proof. 
Inductive  logic,  according  to  him.  should  supply  “rules  and  models 
(such  as  the  Syllogism  and  its  rules  are  for  ratiocination)  to  svhich 
If  inductive  arguments  conform,  those  arguments  are  conclusive, 
and  not  othetsvise."  * The  methods  ate  thttefore  to  be  tests  of  any 
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experimental  procedure.  Just  as  the  e%ndence  for  a proposition  is 
conclusive  if  the  relations  between  the  propositions  offered  in  evi- 
dence and  the  one  in  question  conform  to  the  conditions  for  neces- 
sary inference,  so  an  "inductive  argument"  is  valid,  it  is  claimed,  if 
it  conforms  to  the  "experimental  methods/'  The  conclusion  of  in- 
vestigations into  matters  of  fact  could  therefore  be  .absolutely 
certain. 

We  shall  be  able  to  evaluate  these  tv;o  claims  more  clearly  if  we 
state  what  the  general  nature  of  the  methods  is,  and  if  we  recall 
what  are  the  conditions  under  which  an  inquiry  can  be  carried  on. 
In  the  first  place,  some  selected  portion  of  our  experience  is  taken 
for  further  study  because  of  its  problematic  character-  The  prob- 
lem must  then  be  formulated  in  terms  of  the  situation  which  pro- 
voices  the  inquiry,  and  an  analysis  of  die  situation  must  be  made 
into  a certain  number  of  factors,  present  or  absent,  which  are  be- 
lieved to  be  relevant  to  the  solution  of  the  problem,  Now  the  order 
for  which  wc  are  in  search  is  expressible,  as  we  have  seen,  in  the 
form:  C is  invariably  connected  with  E.  And  this  means  that  no 
factor  can  be  regarded  as  a r.'vte  if  it  is  present  white  the  effect  is 
absent,  or  if  it  is  absent  whiV  he  eiTcct  is  present,  or  if  it  varies  in 
tome  manner  while  the  cdcct  do:s  r.ot  vary  in  some  corresponding 
manner.  The  function  of  experiment  i;  to  determine  with  regard 
to  each  of  the  factors  entertained  as  a possible  cause,  whether  it  is 
invauabty  related  to  the  effect.  If  C and  £ are  two  factors  or  proc- 
esses, there  are  four  possible  conjunctions:  we  may  find  either  CE , 
- CE,  5E,  or  EE,  where  C and  E denote  the  absence  of  these  factors. 
To  show  that  C is  invariably  connected  with  E we  must  try  to  show 
that  the  Second  and  third  alternatives  do  not  occur* 

\Vhcn  the  problematic  situation  is  complex,  and  contains  distin- 
guishable factors  as  components,  see  can  establish  an  invariable 
relation  between  die  effect  and  some  of  the  possibly  causal  factors 
only  by  showing  that  such  factors  do  or  do  not  meet  this  formal 
condition  for  invariable  connection.  It  is  necessary,  therefore,  to 
vary  the  supposedly  relevant  factor,  one  at  a time,  and  consequently 
to  anatyre  the  situation  into  factors  that  are  relatively  independent 
of  one  another. 

The  function  of  experiment,  as  we  shall  see,  is  eliminative.  And 
the  methods  of  experimental  inquiry,  as  we  shall  also  see,  have  pre* 
dsely  that  function. 
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§3.  THE  METHOD  OF  AGREEMENT 

The  Method  of  Agreement  os  a Canon,  vf  Discovery 

Whether  we  are  searching  for  the  catiic  of  some  event  or  for  the 
effect,  we  begin  with  a situation  which  can  be  identified  as  one  of  a 
type . Suppose  that  one  morning  we  find  the  flowers  in  the  gardens 
of  a suburban  town  to  have  withered  over  night.  How  shall  we 
proceed  to  find  the  cause? 

The  first  experimental  canon  instructs  us  as  follows:  If  two  or 
more  instances  of  the  phenomenon  uncut  investigation  have  only 
one  circumstance  in  common , the  circumstance  fin  tvAich  clone  all 
the  instances  agree,  is  the  cause  (or  eject)  of  the  given  phenome • 
non.1  The  phenomenon  we  are  investigating  is  the  withering  of 
flowers;  the  withered  flowers  in  the  several  gardens  are  the  instances. 
We  must,  accordingly,  examine  the  instances  for  the  common  cir- 
cumstances or  factors.  We  note  that  the  gardens  differ  in  many 
ways:  in  the  quality  of  the  soil,  in  the  kinds  of  flowers  they  grow, 
in  their  site,  in  their  location,  in  the  character  of  their  gatxleners. 
We  note  atso  that  the  temperature  had  fallen  very  sharply  during 
the  night.  We  conclude  on  the  basis  of  the  canon  that  the  drop  in 
temperature  is  the  cause  of  the  withering  of  the  flowers.  Why  are 
we  justified  in  drawing  this  conclusion?  Why  do  we  not  say  that 
the  quality  of  the  soil  is  the  cause?  Because,  it  may  be  said,  the 
factors  which  are  absent  when  the  phenomenon  is  present  cannot 
be  invariably  connected  with  the  phenomenon:  the  quality  of  the 
soil  is  not  die  same  in  each  instance  of  the  withered  flowers.  Hence 
the  invariable  circumstance  of  the  drop  in  temperature  must  be  the 
cause.  And  indeed  sharp  changes  in  temperature  are  known  to  be 
fatal  to  flowering  plants.  The  canon  seems  to  be  a successful 
method,  therefore,  both  for  discovering  and  for  "proving* causes. 

Unfortunately,  in  this  illustration  we  knew  what  the  cause  of 
the  withering  of  the  flowers  was  prior  to  the  application  of  the 
method.  It  is  nqt  surprising  that  ve  have  been  able  to  discover  the 
cause.  Wo  must  employ  the  canon  to  discover  the  cause  of  some 
phenomenon  for  which  wc  do  not  happen  to  know  dte  cause  ante- 
cedently. 

Now  baldness  in  men  is  a phenomenon  for  which  the  cause  is 
not  known.  If  the  canon  is  an  efleer I*c  instrument  of  discovery,  no 
itudent  of  logic  need  suffer  long  frem  a naked  scalp.  In  accordance 
frith  the  canon,  we  find  two  or  marc  bald  men  and  starch  for  a 
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single  common  circumstance.  But  immediately  we  run  into  great 
difficulties.  The  method  requires  that  the  men  should  differ  in  all 
respects  except  one.  Wc  will  have  rare  luck  indeed  (or  we  ate  per* 
haps  rather  unimaginative  in  noting  common  circumstances)  if  sve 
succeed  in  obtaining  a collection  of  men  who  satisfy  this  condition. 
(If  sve  are  not  too  particular  about  the  nature  of  the  '‘common” 
circumstance,  sve  may  perhaps  hold  that  die  common  circumstance 
to  all  bald  men  is  their  being  organic  bodies.) 

Let  us  waive  this  point.  We  then  meet  a more  serious  obstacle. 
How  shall  sve  go  about  identifying  the  common  circumstance  or 
circumstances?  If  one  specimen  has  blue  eyes,  must  we  examine 
— - all  the  others  for  color  of  eyes?  If  one  of  the  bald  men  confesses  to 
having  been  brought  up  on  cod  liter  oil,  must  sve  discover  whether 
the  others  were  brought  up  the  same  svay?  But  the  number  of  such 
circumstances  to  which  sve  might  be  directed  is  without  limit. 
Date  of  birth,  books  read,  foo  l eaten,  character  of  ancestry,  char- 
acter of  friends,  nature  of  employment,  are  some  examples  of  cir- 
cumstances  to  which  sve  might  pay  attention.  If,  therefore,  the 
common  circumstance  can  be  found  only  by  examining  all  the  in- 
. stances  for  every  circumstance  which  some  one  or  other  of  the  bald 
‘ group  may  possess,  sve  can  never  find  nil  the  common  factors  In  this 
way.  We  can  carry  on  the  search  for  a ccmmon  factor  only  if  sve 
disregard  most  of  the  ciraim.tr.rv.es  t hick  sve  may  find  as  not  rele- 
vant to  the  phenomenon  of  laldnCfS.  Wc  must,  in  other  words, 
start  the  investigation  with  some  hypothesis  about  the  possible 
cause  of  baldness.  The  hypothecs  which  selects  some  circumstances 
‘ at  possibly  relevant  and  others  as  not  is  constructed  on  the  basis 
of  previous  knowledge  of  similar  subject  matter.  This  hypothesis  is 
cot  supplied  by  the  canon.  Without  some  hypothesis  on  the  nature 
of  relevant  factors  the  canon  is  helpless  to  guide  us  to  our  goal. 

‘ We  have  been  pretending  that  the  circumstances  or  factors  which 
are  present  in  an  Instance  are  distinct  irom  one  another,  and  that 
• • each  comes  labeled,  as  it  were,  with  a tag  saying,  *‘f  am  a circum- 

stance.*’ Rut  it  simply  is  not  tri  e that  an  instance  of  a phenomenon 
b presented  to  us  as  a unique  s:t  of  sharply  defined  factors,  imme- 
diately rccogniiable  by  us  as  such,  so  that  each  factor  can  be  ex- 
. ahrined  and  varied  independently  of  any  other.  Now  the  method  of 
agreement  requires  a comparison  of  circumstances  in  two  or  more 
Instances.  Unless,  therefore,  an  analysis  of  an  instance  into  its  fac- 
tors could  be  made  prior  to  the  u<e  of  the  method,  the  method 
would  be  altogether  useless. 

How  do  we  divide  an  instance  into  its  factors,  and  is  every  analy- 
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sis  of  instances  into  factors  equally  valid?  Consider  the  following 
experiment.  In  two  or  more  test  tubes  of  unequal  sizes  and  each 
filled  with  a liquid  of  a different  color,  a precipitate  is  formed. 
We  wish  to  determine  the  cause  of  the  precipitation  and  are  find 
that  each  instance  can  be  analyzed  into  the  following  circumstances: 

(1)  A test  tube  of  a certain  size  (2)  contains  a liquid  with  a specific 
color  (3)  to  which  sulphuric  acid  has  been  added;  (4)  precipitate 
is  formed.  Factors  1 and  2 are  eliminated  by  the  canon,  since  the 
instances  vary  with  respect  to  these;  the  canon  fixes  upon  the  addi- 
tion of  the  acid  as  the  cause  of  the  phenomenon  (4).  But  this  is  not 
the  only  way  in  which  tire  instances  could  have  been  analyzed.  Wc 
may  have  tried  the  follosving  division  into  circumstances:  (1')  A 
test  tube  of  a certain  size  (2')  contains  a liquid  (3');  sulphuric  acid 
fa. added  to  different-colored  liquids;  (4')  precipitate  is  formed. 
On  the  basis  of  this  analysis,  the  method  would  conclude  that  the 
factor  (2')  fa  the  cause  of  the  precipitation.  This  conclusion  is  in  fact 
false.  It  is  false  because  die  second  method  of  analyzing  the  instances 
fa  not  a "proper**  one. 

Let  us  consider  another  illustration.  We  wish  to  know  the  cause 
of. headaches- We  find  that  it  was  preceded  in  some  cases  by  eye- 
strain,  in  some  iascs  by  indigestion,  and  in  so  cases  by  tire  hard- 
ening or  other  disturbances  of  cernin  blood  vessels.  If  we  take  this 
canon  literally,  none  of  these  ate  common  circumstances  and  there- 
fore to  be  considered  as  causes.  This,  however,  would  be  an  error, 
due  to  an  inadequate  analysis  of  what  is  a headache  and  of  what 
factors  in  atl  these  mentioned  circumstances  are  relevant  to  the 
different  kinds  of  headaches.  A greater  refinement  of  the  causes 
must  be  accompanied  by  an  equal  refinement  of  the  effects.  If  we 
ask  what  fa  the  cause  of  disease,  we  have  grouped  a large  number 
of  phenomena  under  one  rubric  and  the  diverse  causes  of  the  dif- 
ferent kinds  of  disease  must  be  similarly  grouped.  This  point  will 
be  discussed  more  fully  under  the  heading  of  The  Plurality  of 
Causes. 

It  follows  that  not  every  analysis  into  factors  is  equally  valid.  It 
fa  not  valid  because  in  the  light  of  our  knowledge  we  must  not  sep- 
arate. in  such  experiments  as  the  one  above,  the  volume  and  place 
occupied  by  a liquid  from  what  has  been  added  to  it  Now  the 
method  of  agreement  cannot  inform  us  which  fa  the  proper  analy- 
sis. It  cannot  discover  for  us  how  to  divide  instances  into  factors 
such  that  invariable  relations  can  be  found  to  hold  between  some 
of  them.  The  method  cannot  possibly  function  unless,  once  more, 
assumptions  about  relevant  factors  are  made. 
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The  Method  of  Agreement  cs  ii'Ccrton  of  Proof 

Let  us  consider  next  whether  the  method  of  agreement  is  one  of 
. proof,  even  if  we  must  rccogn'r!  thrt  ft  is  not  one  oF  discovery. 
Does  it  follow  that  because  a serucli  for  the  cause  of  a phenom* 
cnon  conforms  to  the  conditions  stated  by  the  method,  the  conclu- 
sion of  the  search  is  thereby  dcnoratrrtcd  to  be  true?  That  it  does 
not  so  follow  can  be  easily  seen.  The  cause  of  a phenomenon  must 
be  invariably  related  with  the  phenomenon.  Hut  we  cannot  ex- 
amine more  than  a limited  number  of  instances  of  any*  alleged  in- 
variable relation.  Even  if  we  could  be  absolutely  certain  that  the 
circumstance  claimed  as  a cans',  is  the  single  common  circumstance, 

can  we  be  certain  that  it  is  irvzti  dh  (for  an  unlimited  number  of 

* instances)  connected  with  the  phenomenon?  Thus  we  may  find  in 
a very  large  number  of  instances  of  typhoid  that  the  activity  of 
microorganisms  is  the  or:c  common  factor  that  is  present.  It  does 
not  follow  tint  this  factor  is  atunyx  present  in  the  stilt  unobserved 
instances  of  typhoid.  Xot  every  actual  conjunction  of  circumstances- 
is  indefinitely  repeated. 

. The  reader  may  perhaps  bclhw:  that  the  inference  from  the  ob- 
served conjunction  of  factors  to  an  invariable  conjunction  is  legiti- 
mate in  virtue  of  the  "uniformity  of  ratine."  We  shall  not  disturb 
the  reader's  faith  in  this  familiar  doctrine  at  this  point*  But.  as  we 
shall  see  presently,  such  a faith  Ins  no  evidential  value  in  demon- 
strating the  existence  of  invariable  connections.  ; 

Not  only(  will  the  method  not.  $cr\c  to  prove  the  presence  of  a 
causal  relation;  it  may,  on  the  co.-mary,  lead  us  to  affirm  some  factor 
to  be  the  cause  when  it  is  not.  We  have  seen  this  alreadyrin  con- 
nection with  the  problem  of  amlyring  instances  into  circumstances. 
We  can  see  this  in  another  way.  Suppose  a professor  of  hygiene 
f/*d$  that  he  had  a splitting  headache  on  three  successive  rights. 
Me  recollects  that  on  Monday  he  ren d for  ten  hours  and  then  took 
ft  walk;  on  Tuesday,  he  found  the  dinner  delicacies  irresistible,  ate 
too  much,  and. then  sought  repentance  by  taking  a walk;  on 
Wednesday,  he  slept  during  the  Jiy  end  then  sought  refreshment 
to  ft  walk.  If  he  were  to  employ  the  method  of  agreement,  he 
might  conclude  that  walking  i as  die  cause  of  his  headache.  But 
this  is  quite  contrary  to  fact,  since  the  walks  he  took  (we  happen 
to  know  on  other  grounds)  ha  o nothing  to  do  with  bringing  on 
the  headaches.  A false  conclusion  was  drawn  by  using  the  method, 
tmee  the  instances  to  which  it  vas  applied  were  not  analyzed  prop- 
erly Into  the  right  circumstance. 
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This  illustration  suggests  another  familiar  doctrine.  The  method 
of  agreement  does  not  provide  a "water-tight"  proof,  it  is  claimed, 
because  there  is  such  a thing  as  a plurality  of  causes.  The  same  phe- 
nomenon is  not  always  produced  by  the  same  cause,  Mill  believed: 
‘There  arc  often  several  independent  modes  in  which  the  same 
phenomenon  could  have  originated."  * A house  may  be  destroyed 
bjr  fire,  or  by  an  earthquake,  or  by  cannon  fire.  Consequently,  this 
method  cannot  find  the  cause.  It  was  such  a reflection  which  com- 
pelted  Mill  to  recognize  an  imperfection  in  the  canon  of  agree- 
ment and  which  led  him  to  supplement  it  with  the  canon  of 
difference.  We  shall  return  to  the  doctrine  of  plurality  of  causes 
presently.  Whether  the  doctrine  is  tenable  or  not,  its  formulation 
and  adoption  by  Mill  show  the  need  for  some  criterion  of  the  cor« 
reel  analysis  of  instances  into  circumstances.  It  is  a criterion  not 
supplied  by  the  canon. 

Employing  the  canon  does  not  guarantee  that  clt  the  necessary 
conditions  for  the  occurrence  of  the  phenomenon  will  be  found.  ' 
Why  is  the  mercury  column  in  barometers  generally  around  thirty 
inches  high?  If  wc  employ  this  method  sve  may  conclude  that  since 
there  is  a vacuum  at  the  top  of  each  column,  the  existence  of  the 
vacuum  Is  die  cause  of  the  observed  height  of  the  mercury.  This 
is  a mistake,  since  wc  know  that  the  occuirencc  of  a vacuum,  is  not 
a sufficient  condition  for  the  height  of  the  column.  The  atmos- 
pheric pressure,  the  temperature  of  the  loom,  are  other  condition! 
which  are  indispensable  in  order  to  explain  the  height  of  the  mer- 
cury. The  method  of  agreement  may,  therefore,  overlook  certain 
generat  conditions  which  must  obtain.  It  may  fix  our  attention  only 
upon  certain  obvious,  even  if  necessary,  features  of  the  instances. 

Tht  /‘iofue  of  the  Method  of  Agreement 

The  method  of  agreement  is  therefore  useless  as  a method  of  dis- 
covery and  fallacious  as  a canon  of  proof.  Has  it  then  no  value? 

It  has  a limited  value,  if  stated  negatively;  Nothing  cart  be  the 
cause  of  a phenomenon  which  is  not  a common  circumsfcrtre  In 
«M  the  instances  of  the  phenomenon.  Thus  stated,  it  is  clearly  a 
method  of  eliminating  proposed  causes  which  do  not  meet  the 
essential  requirements  of  a cause.  A circumstance  that  is  not  com- 
mon to  all  instances  of  a phenomenon  cannot,  by  definition,  be 
causally  related  to  it. 

• A search  for  causes  begins  with  some  assumptions  about  the  po> 
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sfbly  relevant  factors.  Thus  in  studying  baldness  we  may  begin  as 
follows:  Baldness  is  due  to  congenital,  hereditary  factors,  or  to  the 
characters  of  the  diet,  or  to  the  nature  of  the  headwear,  or  to  some 
previous  disease.  The  method  of  agreement  helps  to  eliminate 
some  or  all  of  the  suggested  alternatives-  We  may  discover  that 
the  character  of  the  food  eaten  by  bald  men  is  not  a common  fea- 
ture; and  according  to  the  principle  of  tollendo  portens  we  can 
conclude  therefore  that  only  the  three  other  alternatives  remain 
to  be  examined,  that  is,  baldness  is  either  congenital,  or  it  is  due 
. to  the  nature  of  the  hats  worn,  or  it  is  due  to  some  previous  disease. 
V/e  may  proceed  in  this  fashion  until  we  have  eliminated  all  the 
suggested  alternatives,  or  found  one  or  more  which  cannot  be 
eliminated-  - 

. Unless,  however,  we  have  been  fortunate  enough  to  include  die 
circumstance  which  is  in  fact  the  cause  of  the  phenomenon  in  the 
enumeration  of  alternatives,  the  method  Of  agreement  can  never 
-identify  the  caus  lu  function  is  to  help  eliminate  irrelevant  cir- 
cumstances. 

§ 4.  THE  METHOD  OF  DIFFERENCE 

The  Method  of  Difference  cs  a Canon  of  Discovery  . - 

The  method  of  agreement  was  recognized  as  faulty  by  Mill  be- 
cause we  cannot  be  certain  that  the  phenomenon  investigated  has 
only  one  cause.  It  was  believed  to  be  useful  in  those  cases  where  we 
could  not  alter  the  circumstances  at  will.  Hence,  it  was  regarded 

Enmarity  a' method  of  observation  rather  than  experiment.  It  was 
etieved,  however,  that  the  shortcomings  of  this  method  can  be  over- 
come by  the  use  of  a second  canon,  the  method  of  difference. 

■ This  second  method  requires  two  instances  which  resemble  each 
other  In  every  other  respect,  but  difUr  in  the  presence  or  absence 
of  the  phenomenon  investigated.  Its  full  statement  la  "If  on  m* 
Stance  in  which  t/te  phenomenon  under  investigation  occurs , and 
an  instance  in  which  it  does  not  occur,  have  every  circumstance  in 
common  save  one,  that  one  occurring  in  the  former;  the  circunu 
stances  in  which  alone  the  two  instances  differ,  is  the  effect,  or  the 
cause,  or  an  indispensable  part  of  the  cause,  of  the  phenomenon." * 
Let  us  sec  whether  this  canon  is  elective  in  discovering  causes. 
Suppose  the  reader  buys  two  fountain  pens  of  like  make,  fills  them 
with  the  same  kind  of  ink,  places  them  in  his  pocket,  and  takes  a 
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long  wait  before  he  sits  down  to  write.  When  he  does,  he  discovers 
that  one  of  the  pens  leaks.  What  is  the  cause  of  this?  It  seems  as  if 
the  conditions  for  applying  the  canon  ate  a!!  present.  The  pens  are 
alike,  but  one  leaks  and  the  other  dees  not.  And  if  the  reader  em- 
ploys the  canon  he  may  "discover’’  that  the  rubbet  sack  in  one  of 
the  pens  has  lost  its  elasticity  and  is  extremely  porous;  the  other 
pen  has  not  this  defect.  The  condition  of  the  rubber,  the  reader 
may  conclude,  is  the  cause  of  the  leak. 

But  is  the  matter  as  simple  as  that?  If  we  take  the  canon  seriously, 
we  must  conclude  that  the  method  cannot  be  applied  in  this  in- 
quiry, since  it  requires  that  the  two  pens  be  exactly  alike  in  all  cir- 
cumstances except  those  mentioned.  But  die  two  pens  do  differ  in 
very  many  ways:  one  svas  made  before  the  other,  or  by  a different 
workman;  the  shapes  of  the  pens  have  minute  differences;  chemical 
analysis  reveals  other  differences;  the  pens  were  not  placed  in 
exactly  the  same  position  in  the  reader’s  pocket,  nor  were  they 
warmed  equally  by  his  body. 

If  anyone  should  object  that  the  pens  need  not  be  exactly  alike, 
but  only  atike  in  relevant  factors,  wc  r.ius:  reply  that  it  is  precisely 
such  judgments  of  relevance  which  arc  required  before  the  cation 
may  be  used;  and  that  the  canon  doe;  not  supply  this  vital  infor- 
mation- If,  however,  the  objector  shor’d  declare  that  the  two  pens 
can  be  shown  to  be  alike  by  examining  all  the  circumstances,  we 
would  be  forced  to  reply  that  an  exhaustive  examination  of  the 
circumstances  is  impossible;  and  that  if  it  were  possible,  the  canon 
would  be  unnecessary  to  discover  the  factor  which  is  present  when 
the  phenomenon  is  present  and  ab>er:  when  it  is  absent, 

This  canon,  like  the  previous  one,  requires,  therefore,  the  ante- 
cedent formulation  of  a hypothesis  concerning  the  possible  relevant 
{actors.  The  canon  cannot  tetl  us  what  factors  should  be  selected  for 
study  from  the  innumerable  circumstances  present.  And  the  canon 
requires  that  the  circumstances  shall  have  been  properly  anatyred 
ana  separated.  We  must  conclude  that  it  is  not  a method  of  dis- 
covery. 

The  Method  of  Difference  at  a Canon  of  t'roof 

Is  it  a method  of  proof  or  demonstration?  No  more  than  the 
method  of  agreement!  Whatever  value  the  canon  may  seem  to  have 
depends  on  the  assumption  that  differences  are  noted  with  respect 
to  the  presence  or  absence  of  a tingle  factor.  But  can  the  canon 
assure  us  that  the  factor  is  not  complex) 

Suppose  a man  is  psychically  and  socially  maladjusted.  He 
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fcis  from  unconifoJUiLIc  thca;iw.  lie  zoos  to  n psychoanalyst,  who 
persuades  him  to  disclose  indu  :.*e  details  of  his  autobiography, 
Ariel  in  particular  of  Ins  sex  Me.  Thu  man  "recovers'*  from  the 
maladjustment  and  the  (luvum  ce:-'e.  '1  he  canon  seems  to  be  ap- 
plicable: the  "single  diheu  we*'  In  the  events  of  the  man’s  life 
during  this  period  is  the  cxpied-ii  of  his  hidden  sex  desires.  Can 
we  validly  conclude  that  du  jntnds  t dl.ing  fieely  on  sex  is  the  cause 
of  his  recovery?  Certainly  not.  J he  change  in  the  man’s  life  may 
in  fact  be  elite  simply  to  his  finding  in  the  analyst  a sympathetic 
audience  on  any  subject  wiuievti,  or  it  may  be  due  to  the  cessation 
of  ccilam  organic  distntb.Miii.s  ua! m>;.u  to  the  patient  or  the 
analyst. 

Can  this  method  demon  time  mi  i.^JUiblc  connection  by  an 
examination  of  tivo  instance-?  S-ippw-e  the  vea  la  spends  a slecjr- 
less  night,  but  on  the  foi!cjwin.e  n\J  i lie  rests  peacefully*  The 
reader  may  be  able  to  comince  himself  dial  the  single  "significant” 
difference  in  his  behavior  cvi  1 7 1 c tv/u  days  preceding  lhc.se  nights 
is  that  he  had  drunk  coHcc  on  t!u-  fu*:  day  but  not  on  the  second. 
Can  the  reader  validly  conclude  ilm:  chinking  coffee  is  (fot  hint) 
the  invariable  cruse  of  sleepless! m**,  other  things  being  equal?  li 
may  be  true  that  his  sleeplcsmt^  on  tin*  fn>i  night  was  in  fact  due 
to  the  coffee.  NcvctlhcSos,  it  may  1 e lin  t k is  not  drinking  coffee 
os  such  which  pioduccs  the  iu.h  .h  *h!  ♦ icsuh.  The  insomnia  may 
Lc  clue  to  a drug  that  the  coffee  ro;-t;:i:u%d.  The  reader's  steeple ^ j- 
ness  on  that  particular  night  was  h}  lupothcu*,  due  to  chinking 
tliat  particular  coffee.  Ihu  it  cl*^ j-  n.-i  follow  that  in  general  coffee- 
drinking  is  followed  by  a rc*t!i>>  iTT*-  'J  lie  application  of  die 
canon  does  not,  therefote,  wnlhv.  nh  leu!  m the  detection  of  the 
factors  in  tenus  of  which  an  i uvati  bk  relation  can  be  expressed, 
and  it  may  lead  to  nn  aftr;nat:o:.  i f an  invariant  lelaiion  where 
none  in  fact  exists.  The  canon  i k . not  safeguard  us  against  the 
fallacy  known  as  post  ho;,  ago  piupU-r  hoc:  sleeplessness  may 
(ollo\e  drinking  toffee,  but  People  ness  may  not  occur  because 
coffee  was  diunk. 

The  statement  of  the  canon  eh;1  tty  rccoguires  that  the  factor 
noted  by  it  may  be  only  a /.mf  of  (he  ce.use.  ‘1  his  is  a vciy  impor- 
tant qualification.  T he  in  van  able  ul  Tom  hr  which  the  sciences 
seek  arc  such  that  if  a dctuinin.uc  m ‘ of  uiauouanccs  .arc  present, 
some  other  circumstance  a . ill  akwsvs  accompany  these.  T lie  dis* 
covciy  of  a partial  set  of  ciumn*!  ?w<>  U often  net  enough.  Now 
the  method  of  dHTcicncc  eanuel  g.\n:u  tee  thr.i  the  sufficient  con- 
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dilions  for  a phenomenon  have  been  found.  We  cannot  infer  that 
rain  is  a sufficient  condition  for  the  rich  harvest  in  one  part  of  the 
state  on  the  ground  that  there  was  a drought  in  another  part,  even 
though  the  quality  of  the  seeds  planted  and  of  the  soil  and  the 
quantity  of  sunshine  were  in  all  significant  ways  the  same.  For  it 
isn't  the  rain  clone,  but  the  rain  together  with  the  soil,  seed*,  and 
sunshine,  which  provides  the  adequate  conditions  for  a bumper 
crop.  The  canon  may  therefore  very  easily  direct  our  attention  to 
extremely  partial  and  even  superficial  f.  ctors  in  the  complete  situa- 
cion.  On  the  basis  of  the  canon  we  might  argue  dial  since  condi- 
tions in  Europe  in  January,  1914,  were  the  same  as  in  July,  and 
since  the  only  relevant  difference  was  the  murder  of  the  Ardhduke 
Ferdinand,  the  assassination  was  the  cause  of  the  World  War.  With- 
out denying  the  importance  of  this  event  in  explaining  just  when 
the  war  did  take  place,  no  serious  student  of  affairs  would  hesitate 
to  point  out  the  complicated  national,  diplomatic,  and  socio- 
economic factors  which  were  part  of  the  conditions  required  to 
explain  the  occurrence  of  the  war. 

The  Value  of  the  Method  of  Difference  • ....  ; 

The  method  of  difference  cannot,  therefore,  be  regarded  other  as 
' a.  method  of  discovery  or  as  a method  of  proof.  But,  like  the  method 
of  agreement,  it  has  a limited  value  when  stated  negatively:  No  th- 
ine can  he  the  came  of  a phenomenon  if  the  phenomenon  does  not 
take  place  when  the  supposed  cause  does.  Thus  stated,  it  is  clearly 
’ a method  of  eliminating  one  or  more  proposed  causes  which  do  not 
meet  the  essential  requirement  of  a cause.  A circumstance  that  is 
present  whether  die  phenomenon  is  present  or  not  cannot,  by  defi- 
nition, be  causally  related  to  it.  Thus  if  we  arc  studying  rheuma- 
tism, wc  may  entertain  tire  hypothesis  that  it  is  caused  by  excessive 
starch  in  the  diet,  or  by  lack  of  exercise,  or  by  a foal  infection  in 
the  teeth.  Provided  that  these  alternatives  represent  an  adequate 
analysis  and  separation  of  the  circumstances,  we  may  be  able  to 
eliminate  the  diet  dicory  of  rheumatism  if  we  can  show  that  large 

Suantitic*  of  starch  can  be  consumed  without  being  fcl'.owed  by 
re  effect.  Proceeding  as  we  did  with  the  canon  of  agreement,  we 
may,  in  accordance  with  the  principle  of  tollendo  ponens,  be  able 
to  eliminate  all  but  one  alternative.  Again,  the  methoa  of  difference 
it  helpless  if  we  have  not  die  sagacity  to  include  In  the  alternatives 
considered  for  further  study  the  factor  which  is  in  fact  the  cause. 
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S 5.  the  joint  method  of  agreement  and  difference 

The  two  methods  so  far  considered  require  conditions  for  their 
application  which  we  can  never  find  realized.  The  first  method  re* 
quires  instances  which  arc  unlike  in  ever)-  respect  except  one;  the 
second  requires  instances  which  are  alike  in  every  respect  except 
one.  When  the  phenomenon  is  dependent  upon  a comptex  set  of 
conditions,  it  is  difficult  to  separate  the  factors  involved  and  vary 
- them  one  at  a time.  Mill  therefore  proposed  a combination  of  the 
two  preceding  methods.  Its  formulation  is;  '7/  two  or  more  instances 
in  which  the  phenomenon  occurs  hive  only  one  circumstance  in 
common,  while  two  or  more  instances  in  which  it  does  not  occur 
have  nothing  in  common  save  the  absence  of  that  circumstance ; the 
circumstance  in  which  alone  the  two  sets  of  instances  differ,  is  the 
effect , or  the  cause,  or  an  indispensable  part  of  the  cause*  of  the 
phenomenon”  • 

The  statement  of  the  canon,  however,  is  really  absurcL  Accord* 
fng  to  it  we  require  two  sets  of  instances.  In  one  set  the  phenomenon 
occurs,  and  the  instances  taken  together  must  have  a single  common 
circumstance,  although  taken  two  at  a lime  they  may  agree  in 
more  than  one  circumstance.  In  the  second  set  the  phenomenon 
does  not  occur,  and  the  instances  must  be  so  chosen  that  they  have 
nof/iing  in  common,  taken  together,  n\c  the  absence  of  the  phe- 
nomenon. But  if  see  follow  these  i.mructions  wc  may  include  any* 
thing  we  wish  in  the  second  set,  since  the  absence  of  some  one  char- 
acter need  be  the  only  identical  fc  .turc  in  alt  of  them!  Suppose  we 
wish  to  discover  the  conditions  which  mike  for  divorce.  According 
to  the  method,  we  must  in  the  fiist  place  examine  a number  of 
divorced  couples,  and  in  the  second  place,  examine  a number  of 
cases  where  divorce  has  not  occurred,  for  example,  among  flowers, 
children,  mountains,  bachelors,  and  so  on.  We  could  not  possibly 
use  these  negative  instances  to  determine  the  cause  of  divorce.  We 
must  therefore  modify  the  fotmulatior.  of  the  canon.  The  negative 
instances  must  be  all  of  a ty pc  in  which  the  phenomenon  U capable 
of  being  present  when  the  adequate  conditions  are  supplied. 

As  a metliod  of  discovery  and  proof,  this  canon  combines  all  the 
defects  of  the  first  two  canons;  its  \ittucs  arc  the  virtues  of  cither. 
However,  it  does  formulate  certain  aspects  of  methods  employed  in 
making  comparisons  between  largo  gtoup*.  If  we  were  to  try  the 
method  of  difference  alone  for  finding  the  cause  of  divorce,  we 
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would  require  two  couple?,  one  divorced,  the  other  not,  that  are 
alike  in  every  way  except  one.  This  is  hardly  feasible.  If,  however, 
we  were  to  .examine  a large  nurnbrr  of  married  pairs,  we  might  be 
able  to  show:  that  some  of  the  circumstances  which  are  common  to 
all  of  them  are  not  significant  for  their  continuing  in  the  married 
state,  provided  wc  could  also  show  that  divorced  pairs  show  the 
same  common  features.  Wc  might  be  unable  to  identify  the  cause 
of  divorce  by  tins  method.  Nevertheless,  by  examining  several  large 
groups  we  might  be  able  to  show  some  relation  between  the  relative 
frequency  of  divorce  and  such  factors  as  differences  in  the  age,  edu- 
cation, health,  and  so  on  of  the  parties  to  a marriage.  Such  sta» 
- tistical  information  may  be  all  thru  can  be  obtained.  The  knowl- 
edge of  the  relative  frequency  of  divorce  for  individuals  differing 
- considerably  in  age,  for  cxnmp;  \ will  be  useless  for  determining 
whether  divorce  will  terminate  the  marriage  of  some  particular 
married  pair.  It  may  be  very  useful  in  ascertaining  how  often  we 
may  expect  divorces  in  a very'  large  group. 

§ 6.  THE  METHOD  OT  CONCOMITANT  VARIATION 

The  elimination  of  irrelevant  circumstances,  which  wc  have  seen 
to  be  the  func::on  of  the  preceding  canons,  cannot  be  performed 
by  them  in- all  cases.  For  it  is  sometimes  impossible  to  exclude  or 
■ isolate  the  cause  completely.  If  we  wish  to  find  die  cause  of  die  rise 
and  fall  of  the  ddes  of  rivers  and  seas,  we  cannot  use  the  canon  of 
difference,  since  we  cannot  find  an  instance  in  which  such  a body 
of  water  does  not  show  the  phenomenon  of  tidal  behavior.  We  can- 
not show  with  the  method  of  difference  that  the  sun  and  moon  are 
the  cause  of  tides,  since  we  cannot  eliminate  the  action  of  these 
bodies  in  any  instances.  And  we  cannot  use  the  canon  of  agreement, 
because  we  cannot  remove  from  die  instances  of  tidal  behavior  such 
Ineradicable  common  circumstances  as  the  presence  of  the  fixed 
itars. 

In  such  cases,  however,  we  may  notice  or  introduce  variations  in 
the  degree  or  magnitude  of  the  effect,  and  find  a corresponding 
variation  in  some  circumstance,  without  thereby  completely  elimi- 
nating either  the  effect  or  the  supposed  cause.  The  method  of  con- 
comitant variation  has  been  formulated  by  Mill  to  deal  with  such 
phenomena.  Its  statement  is:  "Whatever  phenomenon  varies  in  any 
manne.  whenever  another  phenomenon  varies  in  some  partlcufat 
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manner,  is  either  a cause  or  cn  effect  of  that  phenomenon , or  is 
connected  unth  it  through  some  fact  of  causation  "• 

This  canon  can  be  employed,  therefore,  only  if  degrees  or  mag- 
monies  of  effects  and  causes  can  be  distinguished.  The  previous 
anons  are  qualitative  methods,  since  their  use  requires  simply  the 
determination  of  the  presence  or  absence  of  some  character  or 
quality,  lhc  present  canon  is  quantitative,  and  requires  the  aid  of 
measurement  and  statistical  technique. 

The  Canon  of  Concomitant  Variation  as  a Method  of  Discovery 
An  examination  of  the  statement  of  the  canon  of  concomitant 
variation  must  make  us  suspicious  of  its  efficacy  as  a method  of 
iscovery.  It  cclaics  that  if  a phenomenon  varies  in  any  manner 
whenever,  another  phenomenon  varies  in  soma  manner,  a causal 
relation  is  present.  Now  if  the  concomitant  variation  Is  actually 
invariable,  and  tins  seems  to  be  required  by  the  word  "whenever,” 
a causal  relation  is  indeed  present.  But  if  in  order  to  employ  the 
canon  we  must  know  antecedently  that  the  mode  of  variation  is 
invariable,  of  what  use  is  the  canon?  Wc  do. not,  in  that  case,  need 
the  canon  to  discover  for  us  the  muse.  And  the  canon  by  its  own 
admission  is  perfectly  helpless  in  finding  the  rule  of  variation  or 
in  demonstrating  that  a supposed  mode  of  variation  is  invariable. 

This  suspicion  is  strengthened  if  we  try  to  use  the  canon.  Sup-  " 
pose  we  notice  that  the  tempetature  in  a region  varies  in  some 
determinate  manner  during  several  months.  mat  is  the  cause  of 
tins  variation?  We  look  for  some  circumstance  present  during  these 
months  which  also  undergoes  some  variation.  Bur  which  circum- 
stances shall  we  examine?  Certainly  not  all  che  circumstances,  not 
even  all  the  varying  circumstances.  Tire  formulation  of  hypotheses 

. and  judgments  of  relevance  are  required  before  this  canon  can  be 
employed. 

.v.-\1C  ®0R,PHea,«J  causal  dependencies  between  several  variables 
whichthe  natural  sciences  study  cannot  possibly  be  unraveled  un- 
less hypotheses  based  on  knowledge  of  mathematical  relationships 
, a*c  . . atc<1  concerning  them.  Even  such  a relatively  simple  rule 
of  variation  as  the  iuvcrse-squaic  lav.-  of  gravitational  attraction 
cannot  be  obtained  by  merely  observing  the  behavior  of  planets. 

The  Method  of  Concomitant  Variation  cs  a Canon  of  Proof 
The  meie  presence  of  a concomitant  variation  of  temperature 
and  some  other  factor  is  not  sufficient  to  establish  a causal  conncc- 
*A  System  of  Logie , Vol.  I,  p.  454. 
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non.  Suppose  the  changes  in  the  daily  temperature  m New  York 
City  during  one  year  could  be  shown  to  vary  with  the  daily  death 
rate  in  China  in  that  period.  Such  correlations  would  be  regarded 
by  most  competent  judges  as  fortuitous,  because  they  have  some 
prior  knowledge  concerning  the  relevant  factors  in  the  production 
of  temperature  changes.  Even  very  high  correlations,  especially  in 
the  social  sciences,  do  not  necessarily  sign’fy  an  invariable  connec- 
tion. For  the  phenomena  between  which  such  correlations  can  be 
established  may  be  in  fact  unrelated  in  any  way  which  would  war- 
rant our  believing  them  to  be  invariably  connected.  A little  statis- 
„ * tical  skill  and  patience  make  it  possible  to  find  any  number  of 
high  .correlations  between  otherwise  unrelated  factors.  Wc  do  not 
discover  causal  connections  by  first  surveying  all  possible  conela* 
V. . “ tions  between  different  variables.  On  the  contrary,  wc  suspect  an 
invariable  connection,  and  then  use  correlations  as  corroborative 
evidence. 

- Moreover,  the  correlations  obtained  on  the  basis  of  an  examina- 
tion of  a finite  number  of  pairs  of  variables  are  unreliable,  because 
we  cannot  be  sure  that  the  rule  of  variation  remains  the  same  out- 
side of  the  actually  observed  limits  of  variation.  We  may,  by  good 
fortune,  come  to  the  study  of  gases  by  selecting  a gas  like  helium 
at  a high  temperature.  We  may  then  observe  that  if  die  tempera- 
ture remains  constant,  the  pressure  varies  inversely  as  the  volume 
of  the  gas.  We  may  observe  tills  rule  of.  variation  for  certain  inter- 
vals of  temperature,  and  then  extrapolate  the  rule  for  every  value 
of  the  temperature,  or  even  for  any  gas  at  any  constant  tempera- 
ture. But  if  we  do  so,  we  are  sure  to  blunder,  since  it  is  now  known 
. that  Boyle's  law  is  true  only  for  a few  gases  under  idea!  conditions. 
Indeed,  a rule  of  variation  which  has  been  found  to  hold  within 
1 certain  intervals  may  become  altogether  inaccurate  outside  those 
' limits,  not  only  because  the  rule  of  variation  is  different,  but  also 
because  circumstances  negligible  within  those  limits  cannot  be 
" neglected  outside  that  interval.  The  period  of  a pendulum  is  pro- 
portional to  the  square  root  of  its  length  if  the  arc  of  the  swing 
.. . . is  small.  When  the  arc  of  the  swing  is  increased,  the  period  (theo- 
retically and  approximately)  is  stilt  related  in  this  way  to  the 
length;  nevertheless,  the  factor  of  air  resistance  must  now  be  con- 
........  sidered,  so  that  die  period  can  no  longer  be  rendered  by  drat  sira* 

pie  formula. 

The  Value  of  the  Method  of  Concomitant  Variation' 

The  method  of  concomitant  variation  cannot  therefore  be  ac- 
cepted as  a method  of  either  discovery  or  proof.  Its  value  lies  pardy 
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in  suggesting  lines  of  inquiry  for  causal  relations  and  ia  helping  to  ! 

corroborate  hypotheses  of  causal  connection.  Its  chief  value,  how-  j 

$ver,  is  to  help  eliminate  irrelevant  circumstances.  For  nothing  will  ! 

be  regarded  as  the  cause  of  a phenomenon  if  when- the  phenomenon  ! 

varies  that  thing  docs  not,  or  when  the  phenomenon  does  not,  that  ! 

thing  does.  Consequently,  the  method  will  help  eliminate  those 
factors  suggested  by  the  hypothesis  guiding  the  inquiry  which  do 
not  conform  to  this  condition.  Mill's  statement  of  the  canon  asserts 
v that  .if  C_jvaric$  whenever  E varies,  C and  E are  causally  related, 

, We  have  seen  that  this  claims  too  much.  All  that  can  be  affirmed  ! 

- is  that  C and  £ ate  not  causally  related  if  C and  E do  not  vary 
- * concomitantly.  And  even  in  this  modified  form  the  method  will  not 

save  us  from  error  if  the  circumstances  denoted  by  C and  E are  not 
..  -.  properly  analyzed,  ... 

~ V § 7.  THE  METHOD  OF  RESIDUES  - J 

The  remaining  method  of  “discovery  and  proof,**  the  method  of  * 

.1*  residues,  expresses  moTe  dearly  than  the  others  the  eliminative  1 

function  of  all  the  canons.  Its  statement  is:  "Subduct  from  any 
phenomenon  such  part  as  is  known  by  previous  inductions  to  be 

. the  effect  of  certain  antecedents , and  the  residue  of  the  phenomenon 

„ is  the.  effect  of  the  remaining  antecedents”1*  - 
• The  method  very  clearly  depends  upon  our  making  use  of  some 

' already  known  causal  connections  in  order  to  isolate  die  influence 
• of  some  other  known  or  assumed  cause  by  means  of  a strictly  de-_ 
ductive  argument. 

A favorite  illustration  for  this  method  is  tire  discovery  of  the 
planet  Neptune  by  Adams  and  Le  Vender.  The  morions  of  the 
planet  Uranus  had  been  studied  by  the  help  of  Newiou#$  theories. 

Its  orbit  was  plotted  on  the  assumption  that  the  sun  and  the  planets 
. within  the  orbit  of  Uranus  were  the  only  bodies  which  determined 
its  motion.  But  the  calculated  positions  of  Uranus  were  not  in 
agreement  with  the  observed  positions.  On  the  assumption  that 
these  differences  could  be  explained  by  the  gravitational  action  of 
a planet  outside  the  orbit  of  Uranus,  the  position  of  such  a hypo- 
thetical planet  (behaving  according  to  the  usual  principles  of  celes- 
tial mechanics)  was  calculated  from  the  perturbations  in  the  mo- 
tion of  Uranus.  And  in  fact  the  planet  Neptune  was  discovered  in 
*the  vicinity  of  the  place  calculated  for  it.  This  achievement  is 
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therefore  credited  to  the  method  of  residues. 

But  the  argument  used  for  locating  Neptune  Is  easily  seen  to  be 
strictly  deductive.  We  must  accc  \ in  the  first  place,  the  univer- 
sality of  Newton’s  theory  of  gravitation.  We  must  assume,  in  the 
second  place,  that  the  motion  of  Urartus  is  determined  by  the 
known  bodies  v ithin  its  ovbit  and  a single  unknown  body  outside 
its  orbit.  The  position  of  this  unknown  body  can  then  be  calcu- 
lated if  we  also  know  how  much  of  the  observed  behavior  of 
Uranus  is  due  to  die  influence  of  the  interior  planets.  Now  the 
canon  of  residues  itself  did  not  pick  out  the  cause  of  the  discrep- 
ancies in  the  observed  behavior  of  Uranus.  A hypothesis  concerning 
the  possible  source  of  this  discrepancy  had  to  be  explicitly  intro- 
duced. The  canon  simply  expresses  the  fact  that,  on  the  assump- 
tions made*  the  interior  masses  were  to  be  eliminated  as  the  causes 
of  the  observed  discrepancies.  It  does  not  suggest  where  the  source 
of  the  residual  phenomena  is  to  be  located.  It  does  not  demon- 
strate that  the  suspected  source  of  such  residual  phenomena  is 
causally  related  to  them. 

In  this  illustration  one  further  condition  for  the  applicability  of 
this  method  must  be  noted.  We  can  calculate  the  position  of  the 
planet  Neptune  only  if  we  know  the  law  according  to  which  forces 
of  attraction  can  be  compounded.  These  forces  arc  supposed  to  act 
“independent! y#*  of  one  another.  This  means  that  if  one  of  the 
interior  planets  should  fly  off  from  the  sol3r  system,  the  magnitude 
of  acceleration  that  each  of  the  remaining  bodies  would  contribute 
to  the  behavior  of  Uranus  could  still  be  calculated  from  their 
known  positions  and  masses.  Wherever  the  forces  studied  are  not 
independent  in  this  sense  (where,  in  other  words,  the  effect  of  two 
forces  cannot  be  calculated  from  a knowledge  of  each  in  isolation) 
the  method  of  residues  cannot  be  employed. 

§ 8.  SUMMARY  STATEMENT  OF  THE  VALUE  CF  THE 
EXPERIMENTAL  METHODS 

We  will  now  summarize  this  long  discussion  of  the  experimental 
canons.  Every  investigation  oj  the  cause  of  a phenomenon  P must 
Mart  with  a hypothesis.  Suppose  Hv  Hv  . . . Ha  are  a set  of  alter- 
native hypotheses  concerning  the  possible  determining  conditions 
of  P.  Th  ? H's  therefore  express  our  sense  of  what  is  relevant  in  any 
occurrence  of  P.  No  observation,  experiment,  or  reasoning  can  pro- 
ceed without  an  explicit  or  implicit  acceptance  of  *he  following: 
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Proposition  I.  Either  Hk,  or  H2  or  . . . Ha  jj  the  causal  law  of 
P.  The  function  of  the  experimental  canons  Is  to  eliminate  some  or 
all  of  these  alternatives.  We  try  to  show  that  In  the  instances  where 
P is  present,  Hx  docs  not  hold:  or  that  II  x is  true  in  those  cases 
where  P is  absent  as  well  as  where  P is  present;  or  that  a variation 
'occurs  in  P without  a correlated  change  in  some  factor  denoted  by 
Ht.  If  Ke  arc  successful  in  showing  any  one  of  these  things  (and  if 
JFfx  represents  a proper  analysis  of  circumstances)*  Ht  is  then  elimi- 
nated as  a causal  law  of  P.  Thus  experiment  may  establish  the  fol- 
lowing: 

Proposition  Hi  is  not  the  causal  lav  of  P. 

We  rcry  then  conclude  from  propositions  1 and  2: 

Proposition  3.  Hx  or  H,  or,  . . . //„  is  the  causal  law  of  P . 

The  same  procedure  can  now'  be  undertaken  for  Hz,  and  so  on* 
And  we  may  be  successful  in  eliminating  all  the  alternatives  but 
Hn . Provided  that  Hn  cannot  be  eliminated,  wc  may  conclude  that 
H*  is  the  causal  law  of  P,  on  the  assumption  that  the  nff  s are  die 
only  possible  causal  laws. 

But  it  is  clear  that  this  procedure  is  efficacious  in  finding  causal 
laws  only  if  the  following  be  true. 

a.  Proposition  1 must  be  based  upon  a proper  analysis  of  the 
circumstances  attendant  upon  P.  The  H' s must  express  the  relevant 
relations  of  P to  certain  other  factors. 

&.  The  n alternatives  H must  include  die  true  causal  law  of  P.  If 
we  have  not  been  fortunate  enough  to  include  the  true  causal  law', 
all  the  alternatives  may  be  eliminated  and  the  cause  of  P not  be 
ascertained.  But  no  directions  can  be  given  how  to  include  the  true 
law  in  an  enumeration  of  possible  laws.  The  difficult  step  in  ex- 
tending our  knowledge  consists,  therefore,  in  finding  propositions 
of  the  form  If  H then  P where  H is  a suitable  hypothesis  or  dieory 
front  which  the  phenomenon  P can  be  shown  to  be  a consequence* 

i*.  Proposition  3 is  obtained  by  strictly  necessary  reasoning  from 
propositions  1 and  2. 

d.  The  concluding  propositions  are  not  demonstrated  to  be  true 
unless  propositions  1 and  2 are  in  fact  true.  But  we  can  rarely,  if 
ever,  be  certain  that  proposition  1 is  an  exhaustive  statement  of 
all  possible  causal  laws  for  P.  . . 

The  canons  of  experimental  inquiry  are  not  therefore  capable  of 
demonstrating  any  causal  laws.  ^ ...... 

The  experimental  methods  are  neither  methods  of  proof  nor 
methods  of  discovery.  The  canons  which  formulate  them  state  in  a 
more  explicit  manner  what  it  is  we  generally  understand  by  a causal 
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or  invariant  relation.  They  define  what  we  mean  by  the  relation  o! 
cause  and  ellect,  but  do  not  f.nd  cases  of  such  a relau'on.  The  hope 
of  discovering  a method  that  will  "leave  little  to  the  sharpness  and 
strength  of  men’s  wits"  is  one  which  finds  no  support  from  a care- 
ful study  of  the  procedure  of  the  sciences. 

But  while  the  methods  we  have  discussed  have  the  defects  pointed 
out,  they  are  of  undoubted  value  in  the  process  of  attaining  truth. 
For  in  eliminating  false  hypotheses,  they  narrow  the  field  within 
which  true  ones  may  be  found.  And  even  where  these  methods  may 
fail  to  eliminate  all  irrelevant  circumstances,  they  enable  us  with 
some  degree  of  approximation  to  so  establish  the  conditions  for 
the  occurrence  of  a phenomenon,  that  see  can  say  one  hypothesis  is 
logically  preferable  to  its  rivals. 

§ g.  THE  DOCTRINE  OF  THE  UNIFORMITY  OF  NATURE 

The  claim  that  the  experimental  methods  are  capable  of  demon- 
strating with  complete  certainty  universal,  invariable  connections 
rests  on  a belief  that  "nature  is  uniform."  Induction,  according  to 
Mill,  consists  in  inferring  from  a finite  number  of  observed  in- 
stances of  a phenomenon,  that  it  occurs  in  all  instances  of  a cer- 
tain class  which  resemble  the  observed  instance  in  certain  ways. 
But  according  to  Mill,  the  very  statement  of  what  induction  is  re- 
quires an  assumption  concerning  the  order  of  the  universe.  The 
asuroption  is  that  "there  are  such  things  in  nature  as  parallel 
cases,  that  what  happens  once,  will,  under  a sufficient  degree  of 
similarity  of  circumstances,  happen  again."11 

This  assumption  may  be  expressed  in  various  ways:  that  nature 
is  uniform,  that  the  universe  is  governed  oy  general  laws,  that  the 
same  cause  will  under  similar  circumstances  be  accompanied  by  the 
same  effect.  In  some  form,  however,  so  the  claim  runs,  it  is  required 
for  induction.  Every  induction  may  be  thrown  into  the  form  of  a 
syllogism,  and  the  principle  of  the  uniformity  of  nature  will  then 
appear  as  the  "ultimate  major  premise  of  all  inductions.” u 

Mill  puts  the  matter  as  follows:  "The  induction,  ‘John,  Peter, 
etc.,  are  mortal,  therefore  all  mankind  are  mortal/  may  . . . be 
thrown  into  a syllogism  by  prefixing  as  a major  premise  (what  is  at 
any  rate  a necessary  condition  of  the  validity  of  the  argument) 
namely,  that  what  is  true  of  John,  Peter,  etc.,  is  true  of  all  man- 
kind. But  how  came  we  by  this  major  premise?  It  is  not  self-evident; 
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nay,  in  all  cases  of  unwarranted  generalization,  it  is  not  true.  How, 
then,  is  it  arrived  at?  Neressarily  either  by  induction  or  ratio- 
cination; and  if  by  induction,  the  process,  like  all  other  inductive 
arguments,  may  be  thrown  into  die  form  of  a syllogism.  This  pre- 
vious syllogism  it  is,  therefore,  necessary  to  construct.  There  is,  in 
the  long  run,  only  one  possible  construction.  The  real  proof  that 
what  is  true  of  John,  Peter,  etc.,  is  true  of  all  mankind,  can  only 
be  that  a different  supposition  would  be  inconsistent  with  the  uni- 
formity which  we  know  to  exist  in  the  course  of  nature.  Whether 
there  would  be  this  inconsistency  or  not,  may  be  a matter  of  long 
and  delicate  inquiry;  but  unless  there  would,  we  have  no  sufficient 
"ground  for  the  major  of  the  inductive  syllogisms.  It  hence  appears, 
r that  if  we  throw;  the  whole  course  of  any  inductive  argument  into 
& series  of  syllogisms,  wc  shall  arrive  by  more  or  fewer  steps  at  an 
"ultimate  syllogism,  which  will  have  for  its  major  premise  the  prin- 
ciple, or  axiom,  of  the  uniformity  of  the  course  of  nature;”1* 

. -We. shall  not  discuss  whether  the  principle  of  the  uniformity  of 
nature  is  true  or  whether  some  such  principle  Is  required  for  mak- 
ing inductive  inferences.  We  wish  simply  to  determine  whether  the' 
principle  if  it  were  true  would  in  fact  help  to  demonstrate  the  exist- 
ence of  some  particular  instance  of  a supposed  causal  relation.  "We 
must  carefully  note  the  following. 

1.  The  principle  is  stated  in  an  extremely  vague  form— "what 
happens  once,  will,  under  a sufficient  degree  of  similarity  of  cir- 
cumstances, happen  again."  But  what  is  a sufficient  degree  of  simi- 
larity? The  principle  docs  not  tell  us.  In  any  particular  investiga- 
tion we  must  rely  on  other  criteria,  if  there  an.  any,  to  determine 
what  are  the  circumstances  material  to  the  occurrence  of  a phe- 
nomenon. 

2.  In  the  second  place,  the  minor  premise  of  an  inductive  syl- 
logism, even  according  to  Mill,  is  a particular  proposition.  There- 
fore even  if  we  employ  a universal  major  premise,  such  as  the  prin- 
ciple of  uniformity  of  nature,  the  premises  are  insufficient  to  dem- 
onstrate a universal  conclusion. 

. 3.  Finally,  the  principle  does  not  affinn  that  every  pair  of  phe- 
nomena are  invariably. related.  It  simply  states  that  some  pairs  are 
so  connected.  To  appeal  to  the  doctrine  in  a particular  investiga* 
- tion  is  therefore  useless.  If  we  suspect  that  tight-fitting  hats  are  the 
cause  of  baldness,  we  employ  the  canons  to  eliminate  as  many  cir- 
cumstances other  than  tight  hats  as  we  can.  But  no  finite  number 
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of  observed  cases  of  tight-fitting  hats  followed  by  baldness  can  dem- 
onstrate a law*  which  is  to  hold  for  an  indefinite  number  of  cases. 
The  piinclple  of  uniformity  of  nature  does  not  help  us.  It  does  not 
say  oF  the  innumerable  casual  connections  between  phe- 

nomena are  invariable;  it  merely  asserts  that  some  are.  But  the 
task  of  the  particular  inquiry  is  to  show  that  a designated  pair  of 
phenomena  are  in  causal  relation. 

§ lO.  THE  PLURALITY  OF  CAUSES  - - 

The  method  of  agreement  is  often  found  to  be  faulty  because  we 
cannot  be  surc-so  it  is  clnimed-tlm  the  ellect  sfudica  may  not 
have  more  than  one  cause.  It  is  for  this  reason  that  the  method  of 
difference  was  regarded  by  Mill  as  a superior  experimental  proce- 
dure. The  doctrine  of  plurality  of  causes  is  stated  by  him  as  fol- 
lows: "It  is  not  true  . . , that  one  efTcct  must  be  connected  with 
only  one  cause,  or  assemblage  of  conditions;  that  each  phenomenon 
can  be  produced  only  in  one  way.  There  are  often  several  inde- 
pendent modes  in  which  the  same  phenomenon  could  have  origi- 
nated. One  fact  may  be  the  consequent  in  several  invariable  se- 
quences; it  may  follow*,  with  equal  uniformity,  any  one  of  several 
antecedents,  or  collection  of  antecedents.  Many  causes  may  produce  — 
mechanical  motion:  many  causes  may  produce  some  kinds  of  sen- 
sations: many  causes  may  produce  death.  A given  effect  may  really 
be  produced  by  a certn  in  cause,  and  yet  be  perfectly  capable  of  bang 
produced  without  it.” 14 

This  doctrine  may  be  given  a logical  version.  The  fallacy  of  af- 
firming the  consequent  in  mixed  hypothetical  syllogisms  may  be 
interpreted  as  an  illustration  of  die  doctrine  of  plurality  of  causes. 
Thus,  gNen  If  a number  expressed  in  ordinary  algorism  has  a-5  in 
the  unit'place,  it  is  divisible  by  5,  we  cannot  validly  infer  that  a 
certain  number  terminates  with  a 5 in  the  unit  place  because  it  is 
divisible  by  5;  the  number  may  terminate  with  a zero.  It  seems, 
therefore,  that  Mill  is  in  the  right  concerning  plurality  of  causes, 
and  his  doctrine  is  capable  of  being  staled  in  ,i  more  general  form 
and  in  purely  logical  terms. 

Let  us  first  consider  the  less  general  form  of  the  doctrine,  as  stated 
by  Mill.  Suppose  a house  bua ns  down.  What  is  the  cause  of  this 
event?  Perhaps  the  house  was  destroyed  because  of  an  overturned 
kerosene  lamp,  or  because  of  defective  ekciric  wiring,  or  because  of 
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a faulty  chimney*  The  reader  may  be  tempted  to  retort  that  the 
plurality  is  only  apparent.  “If  the  alleged  causes  of  the  fire  Vr'cre 
examined  more  carefully/'  he  may  perhaps  say,  "a  circumstance 
common  to  all  of  them  would  'ce  found.  For  example,  the  occur- 
rence of  a rapid  oxidaLion  in  some  part  of  the  house  is  such  a com- 
mon circumstance.  And  that  common  feature  o*  *hc  many  alleged 
causes  is  the  cause  of  the  event/' 

Such  an  analysis  is  rot  very  satisfactory.  If  the  reader  were  inves- 
tigating that  fire  for  an  insurance  company  and  submitted  such  an 
analysis,  he  would  no:  retain  hi;  post  for  long  ‘The  occurrence  of 
a rapid  oxidation/'  the  company  would  doubtless  declare,  “is  an 
explanation  of  all  fires.  It  was  not  your  job  to  discover  the  most 
general  conditions  under  which  fives  occur,  for  we  knew  that  all 
the  time;  it  was  your  job  to  find  die  special  conditions  under  which 
this  one  occurred." 

This  hypothetical  reply  of  the  insurance  company  not  only  indi- 
cates the  inadequacy  of  one  type  of  criticism  of  the  doctrine  of 
plurality  of  causes;  it  also  suggti’s  a more  complete  reply  to  the 
doctrine.  For  if  the  doctrine  were  true,  how  could  we  ever  be  able 
to  infer  the  cause  from  an  examination  of  the  ruins  of  a destroyed 
house?  There  can  be  no  doubt  that  we  frequently  infer  the  true 
cause  of  an  effect.  Fire  insurance  companies  do  so  continually.  So 
also  the  medical  examiner  is  able  to  establish  the  real  cause  of  a 
person's  death  in  spite  of  the  alleged  plurality  of  causes  of  death. 

The  more  satisfactory  reply  to  the  doctrine  of  plurality  of  causes 
is  this:  'When  a plurality  of  causes  is  assorted  for  an  effect,  the  effect 
is  not  analyzed  very'  carefully.  Instances  which  have  significant  dif- 
ferences are  taken  to  illustrate  the  seme  effect.  These  differences 
escape  the  untrained  eye,  although  they  are  noticed  by  the  expert. 
Thus  the  way  in  which  a house  burrs  down  when  an  overturned 
lamp  is  the  cause,  i_s  not  the  same  as  when  defective  wiring  is  the 
cause.  The  doctrine  of  plurality  of  causes  is  plausible  only  if  we 
analyze  the  causes  into  a larger  number  of  distinct  types  than  we 
do  the  effect.  The  doctrine  overlooks  many  differentiating  factors 
present  in  several  instances  of  a so-called  effect,  and  by  viewing  these 
instances  under  their  more  generic  features  regards  them  as  in- 
stances of  the  same  effect.  For  many  purposes  it  is  perhaps  con- 
venient to  retain  this  lack  of  symmetry  in  the  analysis  of  causes  and 
effects.  But  it  docs  not  follow  front  this  fact  of  convenience  that  the 
usual  illustrations  of  plurality  of  causes  really  prove  the  absence 
of  a one-to-one  correspondence  between  cause  and  effect. 

Let  us  now  turn  to  the  docnine  in  its  more  general  or  logical 
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form.  Must  we  deny  that  the  fallacy  of  affirming  the  consequent  Is 
a fallacy?  Not  if  we  rccognite  elementary  distinctions,  and  recall 
some  of  our  discussions  in  the  chapter  on  mathematics.  Affirming 
the  consequent  is  a fallacy  because  the  same  consequent  may  follow 
from  more  than  one  antecedent.  But,  we  may  ask,  if  a proposition 
follows  from  two  distinct  sets  of  premises,  does  it  follow  from  them 
in  virtue  of  their  being  different  from  each  other,  or  in  virtue  of 
their  containing  something  in  common? 

If  the  reader  remembers  our  discussion  of  logical  systems,  he  must 
acknowledge  that  the  second  alternative  expresses  the  true  state  of 
affairs.  We  showed  in  Chapter  VII,  § 3,  that  two  systems  may  be 
incompatible  with  each  other  taken  in  their  entirety  although 
they  may  have  many  theorems  in  common.  We  explained  this  by 
suggesting  that  the  two  systems  contain  a common  subsystem.  The 
common  theorems  of  the  two  systems  follow  strictly  from  the  axioms 
of  tlris  common  subsystem,  and  not  from  the  axioms  of  the  two 
systems  as  such. 

Let  us  state  this  in  different  words.  A set  of  premises  which  is  a 
sufficient  condition  for  a given  proposition  may  contain  conditions 
besides  those  which  are  necessary.  With,  care  and  ingenuity,  those 
portions  of  the  premises  which  are  not  required  for  the  conclusion 
can  be  eliminated.  In  this  way,  we  can  discover  all  the  conditions 
necessary  for  the  conclusion.  And  when  the  antecedent  in  an  im- 
plicative proposition  contains  the  necessary  and  sufficient  conditions 
for  the  consequent,  it  is  no  longer  a fallacy  to  argue  from  the 
affirmation  of  the  consequent  to  the  affirmation  of  the  antecedent. 

The  fallacy  of  affirming  the  consequent  is  therefore  indeed  a 
fallacy,  since  we  do  not  in  general  know  that  the  antecedent  states 
the  necessary  and  sufficient  conditions  for  the  consequent.  For  most 
purposes,  science  is  satisfied  with  the  sufficient  conditions  for  the 
propositions  it  wishes  to  establish.  But  its  goal,  which  may  never 
be  reached,  is  to  find  the  conditions  which  are  both  necessary  and 
sufficient.  

The  distinction  between  sufficient  conditions  and  those  which 
ate  both  necessary  and  sufficient  throws  further  light  on  the  limita- 
tions of  so-called  crucial  expenmen ts.  Suppose  p,  a verifiable  propo- 
sition, follows  from  theory  7\  but  not  from  Tv  Then  a slight  modi- 
fication of  Tv  one  which  leaves  its  main  outlines  unaffected,  may 
perhaps  be  made  so  that  p is  implied  by  the  revised  Tt  as  well  as 
by  Tv  Both  theories,  in  such  a case,  would  logically  contain  the 
necessary  and  sufficient  conditions  for  p,  although  they  may  con- 
tain much  else  besides.  The  verification  of  p,  therefore,  will  not 
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compel  uj  to  abandon  T,  if  we  can  continue  to  use  It  with  scien- 
tific profit  after  slightly  altering  it.  We  may  conclude  with  a per- 
tinent remark  of  Bertrand  Russell:  "a  hypothesis  which  accounts 
with  d minute  exactitude  for  all  known  relevant  fact*  must  not  be 
regarded  as  certainly  true,  since  it  is  probably  only  some  highly 
abstract  aspect  of  the  hypothesis  that  is  logically  necessary  in  the 
deductions  which  we  make  from  it  to  observable  phenomena," 11 

^•Tht  Scientific  Outlook,  1931,  p.  67, 
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CHAPTER  SIX 


CLASS  NOTES 

We  have  omitted  class  notes  for  this  chapter,  since  the  student  will 
already  have  had  a fair  number  of  exercises  in  dealing  with  measures  of 
association  in  Chapter  Two.  Further  exercises  involving  measures  of 
association,  found  in  the  class  notes  for  Chapter  Seven,  introduce  the 
problem  of  dealing  with  a thii'd  variable. 


CHAPTER  SEVEN 


ANALYSIS  OF  DATA 
I 

It  is  a commonplace  in  the  philosophy  of  science  that  facts  never 
"speak  for  themselves. " Before  facts  are  to  have  any  meaning,  they  must 
be  disentangled  and  given  order.  They  must  be  weighed  and  fitted  to  a 
relevant  body  of  ideas  or  theories. 

We  have  included  three  readings  in  this  chapter.  The  first,  "Split 
Trials  and  Time  Saving"  by  Hans  Zeisel  and  Thomas  Callahan, 
represents,  we  think,  an  excellent  example  of  a project  joining  the  law  and 
the  methods  of  research  often  found  in  the  social  sciences.  The  data  are 
analysed  in  a relatively  simple,  clear-cut  manner  with  a relatively  simple, 
specific  question  in  mind. 

The  second  selection,  "Some  Functions  of  Qualitative 
Analysis  in  Social  Research"  by  Allen  H.  Barton  and  Paul  F. 
Lazarsfeld,  ranges  more  broadly.  Fundamentally,  the  article  is  concerned 
with  the  act  of  imagination,  the  process  of  trying  "to  make  sense"  out  of  the 
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infinite  complexities  which  lie  concealed  in  the  data  dredged  up  by  empirical 
inquiry.  And  the  authors  of  the  article  implicitly  recognize  that  in  some  sense 
this  process  must  be  akin  to  artistic  creation,  just  as  in  the  case  of  formulating 
the  hypotheses  with  which  we  begin  our  research.  This  is  not  to  say  that  the 
"truth"  revealed  by  the  analysis  of  our  data  is  arbitrary  or  subjective  or  free 
from  the  canons  of  scientific  proof.  It  simply  means  that  there  is  no  one 
correct  way  to  analyse  the  data,  no  mechanical  set  of  procedures  which  must 
be  followed  in  every  instance. 

The  last  reading  is  by  William  J.  Goode  and  Paul  K.  Halt  and  it 
describes  some  methods  of  analysing  and  presenting  data,  such  as  cross- 
tabulations, frequency,  distributions,  and  so  on,  which  are  already  familiar. 

In  addition,  however,  the  authors  touch  on  a new  and  much  more  complex 
issue—namely  the  introduction  of  a third  factor  into  the  analysis  of  relation- 
ships. As  the  reader  will  see,  Goode  and  Hatt  employ  terms  such  as  interpre- 
tation, explanation,  and  specification,  which  are  part  of  the  language  of  the 
social  sciences,  to  distinguish  different  causal  interpretations.  But  the  basic 
idea  is  fundamentally  the  same:  The  relationship  between  two  characteristics 
is  frequently  modified  by  the  influence  of  a third  characteristic;  and  this  in- 
fluence must  be  statistically  controlled  or  held  constant,  if  we  arc  to  in- 
crease our  understanding. 
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With  these  remarks  in  mind,  we  can  list  some  of  the  modes  of  data 
analysis  which  are  frequently  used.  They  ai'e  very  basic  ways  of  looking  at 
data  which  crop  up  again  and  again  when  we  examine  the  results  of  research 
projects.  Much  of  the  following  material  has  already  been  covered,  of 
course,  for  the  analysis  of  data  involves  such  things  as  examining  the 
procedures  of  sampling,  the  simplification  of  data,  tests  of  significance, 
and  so  on.  We  are  listing  these  aspects  of  research  here,  however,  not 
as  steps  to  be  followed  in  the  collection  of  data  but  as  items  to  be  taken  into 
account  when  we  ask  what  our  data  mean  and  how  far  we  can  generalize  our 
results. 


1.  If  we  draw  a random  sample  from  a population  of  people  or 
of  events  or  of  some  other  type  of  unit,  we  can  make  generalizations  about 
the  population  from  which  it  comes,  as  we  have  seen.  But  can  we  make 
generalizations  to  other  similar  populations?  What  can  one  sampled  commu- 
nity, for  example,  tell  us  about  other  communities?  Will  the  conclusions 
about  a population,  based  on  a sample,  be  valid  at  other  points  In  time? 
There  is  no  definite  answer  to  these  questions — "it  all  depends,  " we  say — 
but  they  are  not  questions  that  can  be  avoided  in  the  analysis  of  data. 


It  is  a very  common  practice  to  extetid  the  conclusions  of  a study 


beyond  the  population  which  has  been  examined.  Either  implicitly  or  explicitly 
it  is  frequently  argued  that  the  conclusions  drawn  from  a study  do  have  implica 
tions  that  go  beyond  the  immediate  case  in  hand.  But  this  is  not  an  assertion 
that  can  be  made  blindly — we  must  note  the  similarities  and  differences  of  the 
population  studied  in  comparison  with  the  larger  set  of  populations  which  con- 
cern us  and  come  to  a reasoned  judgment  of  how  these  similarities  and 
differences  will  influence  our  generalizations. 

Still,  we  must  make  sure  our  starting  point  is  a strong  one--that  is, 
we  must  be  as  certain  as  we  can  that  one  sample  studied  is  representative 
of  its  universe.  One  of  the  first  things  to  do,  then,  in  analysing  data,  is  to 
examine  carefully  and  critically  the  sampling  procedure  used. 

2.  Even  a random  sample,  however,  may  not  be  representative. 
This  is  a point  to  be  checked,  if  at  all  possible,  by  comparing  the  character- 
istics of  the  sample  with  whatever  information  is  available  about  the  universe 
from  which  it  comes.  Census  materials,  official  records,  previous  surveys — 
all  may  provide  summary  information  about  the  population  which  can  be  used 
to  gauge  how  close  the  particular  sample  of  a particular  study  comes  to 
mirroring  the  parent  population. 

3.  In  the  material  on  the  collection  of  data,  it  was  pointed  out 
that  there  are  many  steps  between  empirical  reality  and  the  records  that 
end  up  in  research  project  files.  When  we  examined  the  so-called  instru- 
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ments  used  by  the  social  scientist  to  collect  data — interview  schedules,  un- 
structured questionnaires,  protocols  for  recording  non-verbal  behavior,  etc. 
— we  must  remember  to  ask  if  alternative  methods  of  data  collection  would 
produce  the  same  results.  Even  with  one  particular  method,  such  as  an  inter- 
view schedule  used  in  a survey,  important  changes  in  the  end  product  can 
occur  with  changes  in  the  wording  of  questions,  the  order  In  which  they  are 
asked,  and  so  on. 

4.  We  have  continually  stressed  the  concept  of  relationship  in 
these  materials,  arguing  that  the  concept  was  the  very  stuff  of  which 
scientific  reasoning  is  made.  One  of  the  most  important  types  of  relation- 
ships involves  the  covariation  of  two  variables,  for  it  provides  a model  for 
much  of  our  thinking  about  relationships  between  characteristics  in  the  form 
of  attributes.  When,  however,  we  are  actually  dealing  with  variables  rather 
than  characteristics  such  as  attributes,  in  looking  at  the  data  accumulated  in 
a particular  project,  we  must  be  very  sure  that  certain  assumptions  under- 
lying our  calculations  are  valid.  One  of  the  most  important  of  these  is  the 
difference  between  rectilinear  and  curvilinear  relationships  discussed  in 
some  detail  in  the  readings  in  the  chapter  on  correlation.  If  we  make  a 
mistake  on  this  point,  we  can  seriously  distort  the  reality  embedded  in  our 
data. 
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5.  As  indicated  earlier,  we  often  find  it  convenient  to  simplify  our 
data  when  we  wish  to  get  at  the  underlying  patterns  of  relationships.  Some- 
times this  is  a matter  of  calculating  means  and  standard  deviations,  per- 
centages, and  so  on.  However,  it  may  also  involve  converting  a variable 

or  a set  of  ranked  categories  into  a dichotomous  attribute.  The  problem 
here  is  one  of  deciding  on  the  cutting  point — and  although  there  are  no  hard- 
and-fast  rules  on  this  issue,  we  often  try  to  break  the  frequency  distribution 
approximately  in  half.  The  dichotomizing  of  variables  into  attributes  means 
that  we  lose  a good  deal  of  detail  in  our  data;  but  it  also  may  mean  that  the 
analysis  is  a good  deal  less  complex. 

6.  We  often  find  that  characterizing  a unit  of  observation  in 
terms  of  one  characteristic  at  a time  is  not  as  revealing  as  we  would  like 
it  to  be.  We  need  to  construct  scales,  indices,  and  other  forms  of  scoring 
devices,  which  are  really  ways  to  characterize  units  of  observation  with  a 
great  deal  of  precision  along  a single  dimension.  This  issue  is  beyond  the 
scope  of  our  introduction  to  research  methods  in  the  social  sciences,  but 
the  student  should  at  least  be  familial  with  one  elementary  form  of  dealing 
with  multiple  characteristics,  namely,  the  construction  of  typologies  based 
on  two  dichotomized  attributes.  Wo  may,  for  example,  in  analysing  data, 
classify  individual^  into  "high’'  or  "low"  in  terms  of  two  characteristics; 
and  from  the  resulting  2x2  table  we  can  construct  a new  classification  of 
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individuals— a typology— based  on  the  joint  occurrence  of  the  attributes.  Thus, 
T^pe  1 would  be  "High-High1';  Type  II  "High-Low";  Type  III  "Low-High";  ard 
Type  IV  "Low-Low.  " The  usefulness  of  this  simple  procedure  is  well  present- 
ed in  the  reading  by  Barton  and  Lazarsfeld. 

7.  Once  we  have  established  the  characteristics  we  wish  to 
examine  (variables,  attributes,  types,  etc.)  our  next  task  is  to  examine  their 
inter-relation3hips.  An  important  part  of  this  task  involves  checking  the 
nature  of  this  relationship,  i.e.  , is  it  curvilinear  or  rectilinear?  In 
addition,  however,  we  are  concerned  with  the  extent  of  the  relationship — 
and  for  this  we  need  some  form  of  measurement.  The  readings  have  touched 
on  two  forms  of  measurement  which  are  frequently  encountered  in  the 
analysis  of  data:  Yule's  Q for  attributes  and  Pearsonian  Product-Moment 

r for  variables.  The  student  should  be  familiar  with  the  interpretation  of 
these  measures,  since  he  will  inevitably  encounter  them  in  the  literature 
of  the  social  sciences. 

8.  The  need  to  sample  a population,  rather  than  examining  every 
unit  it  contains,  has  been  argued  before  at  some  length.  And,  as  we  have 
pointed  out,  the  sample  gives  our  best  estimate  about  the  population  or 
universe  from  which  it  comes.  We  are  still  faced  with  the  task  of  judging 
the  accuracy  of  our  estimate,  however,  and  this  involves  establishing  confi- 
dence limits  and  making  tests  for  the  significance  of  differences.  The 
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important  point  to  remember,  in  the  analysis  of  data,  is  that  as  valuable  as 
a sample  may  be,  it  cannot  be  taken  at  face  value.  We  must  logically  work 
out  the  limits  of  its  accuracy. 

9.  The  examination  of  relationships  may  indeed  by  the  stuff  of 
which  science  is  made,  as  we  have  argued,  but  it  is  clear  that  the  com- 
plexities of  the  empirical  world  demand  more  than  the  examination  of  two 
characteristics  at  a time.  The  task  of  ascertaining  the  influence  of  other 
characteristics  can  grow  extremely  difficult  and  complex,  but  we  can  limit 
ourselves  to  three  forms  which  are  fundamental. 

First,  we  can  ask  if  an  observed  relationship  can  be  accounted  for 
by  an  Intervening  variable,  in  the  sense  that  the  Independent  variable  causes 
the  intervening  variable  (Goode  and  Hatt  refer  to  the  intervening  variable  as 
the  test  variable)  which  in  turn  causes  the  dependent  variable.  If  we  can 
show  (1)  that  the  Independent  variable  is  related  to  the  Intervening  variable; 

(2)  that  the  intervening  variable  is  related  to  the  dependent  variable;  and 

(3)  that  the  relationship  between  the  independent  and  dependent  variable  is 
much  diminished  or  disappears  when  the  intervening  variable  is  held  con- 
stant, we  have  provided  evidence  for  our  theory.  (Goode  and  Hatt  give  us 
an  illustration  with  attendance  at  a private  or  public  secondary  school  as 
the  independent  variable,  secondary  school  grades  as  the  intervening 
variable,  and  grades  in  college  as  the  dependent  variable.) 
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Second,  we  can  ask  if  an  observed  relationship  can  be  accounted  for 
by  an  antecedent  variable,  in  the  sense  that  two  chax'acteristics  are  related 
not  because  one  causes  the  other,  directly  or  indirectly,  but  because  both 
are  caused  by  a third  variable — the  so-called  antecedent  variable.  The 
statistical  manipulation  is  the  same  as  in  the  case  where  we  try  to  determine 
the  influence  of  an  intervening  variable — we  hold  the  third  variable,  i.e. , the 
antecedent  variable,  constant  and  see  if  ihe  observed  relationship  disappears. 
(Goode  and  Halt  use  economic  status  and  birth  rate  as  independent  and 
dependent  variables  respectively,  and  education  as  a possible  antecedent 
variable. ) 

Third,  we  can  analyse  our  data  to  see  if  an  observed  relationship 
between  two  characteristics  increases,  decreases,  or  remains  the  same 
when  a third  characteristic  is  held  constant.  That  is  to  say,  we  measure 
the  obsei’ved  relationship  for  all  those  cases  in  which  the  third  character- 
istic is  present  and  we  measure  the  same  relationship  for  all  those  cases 
in  which  the  third  characteristic  is  absent.  This  examination  of  the  relation- 
ship under  varying  conditions  is  referred  to  as  specification ; and  Goode  and 
Halt  use  as  an  illustrction  the  relationship  between  education  and  rank  in  the 
Army  under  conditions  of  long-term  or  short-term  service.  Again,  the  con- 
struction of  the  necessary  tables  is  the  same  as  in  the  two  previous  types  of 
analysis,  although  there  Is  a difference  in  the  nature  of  the  theoretical  issues 
which  arc  being  demonstrated. 
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With  this  chapter  on  the  analysis  of  data,  we  come  to  the  end  of  our 
introduction  to  the  ’onion  of  law  and  social  science  research.  We  have  tried 
to  stress  what  we  think  are  the  most  fundamental  ideas  and  we  have  tried  to 
avoid  a too  narrow  concern  with  specific  techniques.  But  these  materials 
are  not  viewed  as  some  definitive  set  of  readings  which  must  be  covered  in 
a rigid  and  exhaustive  fashion.  Teaching  materials  such  as  these,  we  believe, 
must  be  used  in  a flexible  manner,  with  the  teacher  and  students  adapting 
them  to  their  particular  interests  and  background. 

In  our  experience  with  using  these  materials  in  the  law  school,  we 
have  found  that  law  students  are  usually  more  concerned  with  the  substantive 
issues — at  least  initially — than  with  the  methodological  issues.  This  Is 
understandable  enough;  and,  in  fact,  we  think  that  teaching  social  science 
methodology  to  law  students  successfully  requires  a continuous  emphasis 
on  substantive  application.  At  the  same  time,  we  think  it  is  worthwhile  to 
reiterate  the  basic  argument  on  which  this  collection  of  readings  is  founded-- 
namely,  that  the  Judicious  use  of  substantive  material  drawn  from  the  social 
sciences  is  dependent  on  at  least  an  elementary  understanding  of  the  research 
methods  by  which  such  material  is  assembled.  If  this  collection  of  readings 
aids  the  law  student  in  achieving  that  understanding,  we  will  have  achieved 
our  purpose. 
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SIMPLE  TECHNIQUES  AND  IMPORTANT  RESULTS 


Hans  Zeisel  and  Thomas  Callahan  were  asked  by  the  District 
Court  for  the  Northern  District  of  Illinois  to  study  the  time  saved 
by  splitting  trials  into  two  separate  proceedings — one  to  determine 
liability  and  one  to  determine  damages.  We  think  that  their  research 
provides  an  excellent  example  of  the  practical  value  in  social  science 
in  solving  pressing  legal  problems.  There  are  two  points  particularly 
worth  noting:  (1)  the  research  design  is  an  approximation  of  a true 
experiment;  and,  (2)  the  authors  use  relatively  simple  mathematics 
and  yet  produce  important  results. 

In  discussing  random  sampling  procedures,  Mueller  and 
Schuessler  in  Chapter  Four  make  the  point  that  the  "ideal  sampling 
procedure  is  one  in  which  the  drawings  are  aifected  by  impartial 
chance  factors  alone."  This,  of  course,  is  a device  to  insure  that 
the  samples  present  an  accurate  representation  of  the  universe  under 
observation.  Zeisel  and  Callal  an,  due  to  the  technicalities  sur- 
rounding the  adoption  of  a judicial  rule,  could  not  use  random 
selection  procedures.  Rather,  they  had  to  rely  upon  data  "routinely 
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collected  by  the  Office  of  United  States  Courts.  " As  a result,  we 
cannot  be  sure  if  the  cases  tried  by  one  method  are  the  same  as 
those  cases  tried  by  another — and,  as  a consequence,  we  cannot 
be  sure  if  any  difference  in  outcome  is  due  to  the  difference  in  the 
method  of  trial  or  some  other  factor.  The  student  should  note  how 
the  authors  come  to  grip  with  this  problem. 

As  for  the  use  of  simple  mathematics,  the  reading  makes  it 
clear  that  the  law  student  can  engago  in  important  research  without 
necessarily  using  formulas  for  correlation,  tests  for  significance, 
etc.  In  fact,  the  authors  rely  heavily  on  simple  percentages  and 
simple  logic — but  the  methods  fit  their  purpose  and  the  results  are 
more  than  adequate. 


From  Hans  Zeisel  and  Thomas  Callahan,  "Split  Trials 
and  Time  Saving:  A Statistical  Analysis,"  Harvard 
Law  Review,  76  (1963),  pp.  1606-1625.  Reprinted 
by  permission. 
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Hans  Zeisel*  and  Thomas  Callahan**, 
' "Split  Trials  and  Time  Saving:  A 

Statistical  Analysis,"  Harvard  Law 
Review.  76  (1963),  pp  1606>1625. 


In  the  United  States  District  Court  for  the  Northern  District 
of  Illinois  adopted  a ride  permitting  separate  trials  of  liability  and 
damage  issues  in  civil  cases.  At  the  request  *f  the  court  the  authors 
hove  conducted  an  investigation  to  determine  the  extent  to  which 
time  is  saved  through  use  of  the  separation  device.  They  use  a 
variety  of  converging  statistical  approaches  to  arrive  at  the  conclu- 
sion that  in  personal  injury  jury  trial  cases  about  twenty  per  cent 
of  trial  time  may  be  saved.  They  then  consider  the  possible  ways 
that  separation  might  lengthen  trial  time , and  conclude  that  these 
will  not  offset  the  time  saving  to  any  degree. 

Among  the  devices  aimed  at  reducing  live  trial  load  of  our 
courts,  the  split  or  separate  trial  of  issues  has  been  proposed  as 
holding  special  promise.  Trial  time  would  be  saved,  the  argument 
goes,  if  the  issues  of  liability  and  damages  were  tried  separately. 
The  jury  would  be  required  first  to  bring  in  a verdict  on  liability, 
and  only  if  liability  were  found  would  the  jury  hear  evidence  and 
render  a second  verdict  on  the  issue  of  damages.1  On  the  recom- 
mendation of  the  late  Judge  Julius  H.  Miner,®  the  United  States 
District  Court  for  the  Northern  District  of  Illinois  was  persuaded 
to  test  this  device,  and  on  November  3,  1959,  adopted  civil  rule 
21,  set  forth  below,3  permitting  separate  trial  on  motion  of  a party 

t We  arc  happy  to  acknowledge  the  generous  support  of  Ibis  study  by  the  Wal- 
ter E,  Meyer  Research  Institute  of  Law*  This  Investigation,  an  offshoot  of  the 
University  of  Chicago  Law  School's  prior  work  on  the  jury  system  ar.d  court 
congestion!  was  conducted  In  closest  collaboration  with  Professor  ffariy  Kalven, 
Jr,,  Director  of  the  Jury  Project* 

♦ Professor  of  Law  and  Sociology,  University  of  Chicago  Law  School*  Dr.  jur., 
University  of  Vienna,  1927,  Dr.  rer.  pot,  1928. 

••Third-year  student.  University  of  Chicago  Law  School;  Research  Assistant, 
Jury  Project,  since  1957,  M.A.,  University  of  Chicago,  196 a. 

'Such  a plan  was  proposed  in  Zeisel,  Kilven  1 Pucunoiz,  Delay  is  toe 
Court  99  (1959)  [hereinafter  cited  as  Delay]. 

1 Miner,  Court  Congath..:  A New  Approach,  4s  A.BA.J.  1265,  n63  (1959). 
Judge  Miner  died  on  March  si,  1963,  the  very  day  this  report  was  submitted  to 
his  court.  The  cause  of  court  administration,  and  especially  Us  students,  owe  him 
a great  debt  cf  gratitude  for  bis  determined  efforts  to  pioneer  what  promises  to  be 
a uniquel>  powerful  delay  remedy. 

3 N.D.  III.  (Crv.)  R.  ?i,  set  forth  In  2 Fin.  Rules  5erv\  ao  104P-49  (1960): 
Pursuant  to  and  in  furtherance  of  Rule  42(b),  Federal  Rules  of  Civil  Pro- 
cedure, to  curtail  undue  delay  In  the  kdmlnlstr.ulon  of  justice  in  personal  injury 
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or  at  the  disection  of  the  court.  Though  the  rule  was  sponsored  in 
order  to  save  trial  time,  a cc  "ledity  in  short  supply  in  most  met- 
ropolitan courts,  attempts  \\ ere  soon  made  to  upset  it  on  the 
ground  that  this  major  departure  from  the  traditional  mode  of 
trial  might  affect  the  substance  of  the  verdicts  rendered  by  juries. 
Bu;  the  Court  of  Appeals  for  the  Seventh  Circuit  held  that  the  rule 
preserved  the  essential  character  of  trial  by  jury  and  was  within 
the  ambit  of  Rule  42(b)  of  the  Federal  Rules  of  Civil  Procedure, 
which  permits  separated  trial  of  issues  in  the  interest  of  conven- 
ience.4 

Prior  to  the  promulgation  of  the  rule,  Chief  Judge  Campbell 
asked  the  Law  School  of  the  University  of  Chicago  to  design  a re- 
search operation  that  would  measure  the  effect  of  separation  on 
the  trial  load  of  the  court.6  A later  report  may  deal  with  the 
above-mentioned  substantive  considerations;  the  present  report 
treats  only  of  one  aspect  of  the  rule  — its  impact  on  trial  time. 

The  way  in  which  separation  of  issues  may  save  court  time  is 
clear.  In  the  traditional  form  of  trial,  the  damage  issue  must  be 
litigated  even  where  tie  verdict  will  ultimately  reject  liability; 
separation  would  eliminate  the  need  for  trying  the  damage  issue  in 
those  cases,  comprising  rougdily  40  per  cent  of  all  personal  injury 
jury  trials.®  Even  where  liability  was  affirmed,  it  seemed  likely 

and  other  civil  litigation  wherein  the  Issue  of  liability  may  be  adjudicated  as 
a prerequisite  to  the  determination  of  any  or  all  other  Issues,  in  jury  and 
non-jury  cases,  a separate  trial  may  be  had  upon  such  issue  of  liability,  upon 
motion  of  nny  of  the  parties  or  at  the  Court's  direction,  In  any  claim,  cross- 
claim,  counterclaim  or  third-party  claim. 

In  the  event  liability  Is  sustained,  the  Court  may  recess  for  pre-trial  or 
settlement  conference  or  proceed  with  the  trial  on  any  or  all  of  the  remaining 
issues  before  the  Court,  before  the  same  jury  or  before  another  Jury  as  con- 
ditions may  require  and  the  Court  shall  deem  meet. 

The  Court,  however,  may  proceed  to  trial  upon  all  or  nny  combination 
of  issues  if,  in  its  discretion,  and  in  furtherance  of  Justice,  ft  shall  appear  that 
a separate  tin!  will  work  a hardship  upon  any  of  the  parties  or  will  result  fn 
protracted  or  costly  litigation. 

4Ho*ic  v.  Chicago  & NAV.  Ry , 2S2  F.zd  639  Olh  Cir.  i960),  ccrt.  denied,  365 
U S.  Sr*  (1961),  The  court  did  not  decide  whether  the  use  of  different  juries  for 
liability  and  damage  Issues  ns  authorized  by  the  district  court  rule  violates  the 
*cvcn4h  amendment.  The  district  court  has  confined  the  operation  of  rule  21  to 
single  juries.  C/.  O’Donnell  \\  Watson  Bros.  Tramp.  Co.,  183  F.  Supp.  577  (N.D. 
HI.  i960). 

5 Throughout  the  study  Judges  Campbell,  Miner,  and  Robson  acted  as  advisers, 
for  which  \vc  owe  them  a great  debt  of  gratitude.  Wc  also  want  to  thank  the  other 
Members  of  the  court  and  Its  clerks  for  their  meticulous  help  throughout  these  two 
years.  Wc  arc  ecpcchlly  indebted  to  Mr.  Roy  Johnson,  Special  Assistant  to  Judge 
Campbell,  and  to  Miss  AHccnc  C,  Coldby,  Chief  Deputy  Clerk,  for  their  untiring 
cooperation.  Appreciation  Is  also  due  to  Mr.  Oiln  S,  Thiel  and  Mr.  Ronald  H. 
bf  atty  cf  the  Administrative  Office  of  the  United  States  Courts  for  their  thought* 
f d help  in  supplying  us  with  a series  of  control  statistics  from  tlielr  office, 

* Trom  a nationwide  sample  of  civil  Jury  trials,  which  forms  part  of  the  basic 
of  the  Jury  Project  of  the  University  of  Chicago  Daw  School. 
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that  trial  time  might  be  saved  by  the  stimulus  to  settlement 
before  the  damage  issue  is  litigated.  There  remained,  however,  the 
possibility  that  these  savings  might  well  be  offset  by  a number  of 
countervailing  factors  not  so  immediately  obvious.  Thus  the 
separation  of  issues  might  reduce  the  proportion  of  cases  settled, 
and  thereby  increase  the  number  cf  cases  requiring  trial.  It  might 
reduce  the  ratio  of  jury  waiters  and  thereby  add  to  the  number  of 
cases  requiring  jury  tria!.T  Separation  might  increase  the  propor- 
tion of  hung  juries,  and  necessitate  more  retrials.  Finally,  it  might 
simply  require  more  time  of  juries  overall,  since  in  some  cases 
there  would  he  two  deliberations.  The  crucial  question  for  this 
study  was,  therefore,  not  so  much  whether  but  rather  how  much 
time  would  be  saved  by  the  separation  of  issues. 

Because  our  inquiry  was  set  up  only  shortly  before  the  adop- 
tion of  the  separation  rule,  we  were  unable  to  acquire  data  on  the 
period  before  the  rule  which  we  should  have  wished  to  have  in 
order  to  compare  the  operations  of  the.  court  before  and  after  the 
separation  rule  was  introduced.  Instead,  for  our  prior  control 
period  we  had  to  rely  for  information  on  data  routinely  collected 
by  the  statistical  branch  of  the  Administrative  Office  of  the  United 
Stales  Courts.  The  ideal  research  design  would  have  combined 
such  a prior  observation  period  with  a selection  by  chance  of  the 
cases  to  which  the  separation  rule  would  be  applied.  Specifically, 
one  might  have  subjected  every  three  successive  cases  filed  to  a 
sort  of  lottery:  in  the  case  drawn  first,  separation  would  be  re- 
quired; in  the  second,  separation  would  not  be  allowed;  in  the 
third  case,  separation  would  be  left  to  the  judge’s  discretion. 
These  groups  of  three  would  be  assigned  in  rotation,  giving  each 
judge  an  equal  number  of  cases  of  each  variant.  After  final  dis- 
position, the  separated,  the  regular,  and  the  cases  where  separa- 
tion was  discretionary  would  be  compared  and  measured  against 
the  period  that  preceded  the  introduction  of  the  rule.  But  this 
division  of  cases  was  of  course  not  feasible  for  a variety  of  good 
reasons.8  Instead,  all  that  could  be  done  was  to  keep  a careful 
record  of  the  court’s  actions  in  each  case  from  assignment  to  final 
disposition,  both  for  cases  in  which  there  was  separation  and  for 
the  regular  trials.  We  had  no  influence  and  of  course  did  not  at- 
tempt to  exert  any  influence  on  the  choice  of  cases  in  which  the 

’The  New  York  experience  showed  that  roughly  40^0  less  time  fs  used  up  In  a 
bench  trial.  Dklay  81. 

*C).  The  Case  for  the  Official  Experiment,  Dexay  841-50;  Zelsel,  The  .Year 
Fort  Expert  Testimony  Project:  Some  Reflections  on  legal  Experiments,  S Stan. 
L.  Rtv.  750,  ?4t--i* *  ('95<). 
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Issues  were  separated.  Each  judge  applied  the  broad  discretion 
conferred  on  him  by  the  rule  according  to  his  own  understanding 
of  its  suitability"  for  the  case  at  hand. 

I 

To  begin  the  analysis,  then,  we  present  a synopsis  of  the  civil 
cases  tried  before  the  court.  Table  i gives  the  proportion  of  cases 
tried  under  the  separation  rule  in  each  category.  The  table  shows 
that  the  relative  frequency  of  such  trials  varies  greatly  with  the 
type  of  action.  Two  categories  show  no  separation  whatsoever: 
admiralty  cases  and  tort  cases  other  than  personal  injury  claims. 
The  greatest  use  of  separation  was  made  in  personal  injury  trials: 
here  37  per  cent  of  the  jury  cases  and  27  per  cent  of  the  bench 
trials  were  tried  under  the  separation  rule.  Separation  was  ordered 


TABLE  1 

Relative  Frequency  of  Separated  Trials  uy  Type  of  Case  1 


Regular  Trials 
Kumber 

Separated  Trials 

Percentage 

dumber  cf  Total 

TOTAL 

Personal  Injury: b 

Jury 

U7 

69 

37% 

1S6 

Non  jury 

22 

3 

27% 

_3 0 

216 

Other  Tort: 

Jury 

10 

, . 

10 

Non  jury 

4 

4 

14 

Contract: 

Jury 

19 

6 

24% 

25 

Nonjury 

17 

2 

11% 

19 

44 

Federal  Tort 

Claims  Act: 

Nonjury 

10 

4 

*9% 

14 

14 

Admiralty: 

Jury 

1 

, . 

1 

Nonjury 

6 

. . 

6 

* 

7 

Other: 

Jury 

3 

8 

Nonjury 

34 

4 

Ilfo 

_38 

46 

TOTAL: 

Jury 

155 

75 

33% 

230 

Nonjury 

93 

18 

:6% 

III 

24S 

03 

27% 

341 

1 For  the  two-year  period  after  introduction  of  rule  n (1950-1961). 

1 Includes  :6  Juiy  and  6 nonjury  FiXA  cares,  of  which  one  Jury  (rht  was 
nparated. 
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in  only  one  ovit  of  twenty-two  FELA  crises  but  in  29  per  cent  of 
the  fourteen  Federal  Tort  Claims  Act  cases.  These  frequencies 
forced  us  to  limit  our  inquiry  to  personal  injury  cases:  only  this 
category  provided  a sufficiently  large  group  of  separated  (69)  and 
regular  (117)  trials  to  offer  an  adequate  basis  for  statistical  com- 
parison. It  would  be  improper  to  compare,  for  instance,  separated 
ton  cases  with  nonseparated  contract  cases,  since  the  validity  of 
any  comparison  requires  that  the  cases  be  kept  as  homogeneous 
as  possible.  Strictly  speaking,  therefore,  conclusions  of  the  study 
will  apply  only  to  personal  injury  jury  cases,  but  within  reason 
they  should  prove  applicable  also  to  other  types  of  cases. 

Table  2 proceeds  to  the  comparison  of  separated  and  regular 
personal  injury  trials.  The  1S6  trials  by  jury  during  the  obser- 
vation period  are  divided  by  the  stage  at  which  they  terminated. 
We  distinguish  four  stages  for  the  separated  trials  and  four 
roughly  parallel  stages  for  the  regular  trials.  The  midpoint 
for  the  regular  trial  comes  at  the  end  of  the  plaintiff’s  case;  for 
the  separated  trial,  at  the  end  of  the  liability  trial. 

TABLE  2 

Starks  of  Disposition  and  Time  Required  To  Reach  Them  1 


Separated  Trials  Rt’«u!ar  Trials 


5las<  of  DI?posU3iin 

Days 

Ter  Cent 

Sutge  of  Deposition 

Day> 

Per  Ccnl 

During  l iability 

During  Plaintiff’s 

Trial 

1.9 

19% 

Case 

2.0 

1 8% 

At  End  of  Liability 

At  End  of  Plaintiff’s 

Trial 

3-2 

62% 

Case 

3-3 

4% 

During  Damage 

During  Defendant’ 

s 

Trial 

3-6 

4% 

Case 

After  Full  Trial 

4-0 

157° 

Alter  Tull  Trial 

4-7 

7 $7 

I Average  All 

Average  All 

Cases 

3.1 

100% 

Cases 

4.2 

ICO^ 

Number  of  Cases  (69)  Number  of  Cases  (117) 

•Persona)  injury  jury  trials.  Only  court  time  is  considered;  deliberation  time  is 
not  Included.  Figures  in  the  “Days”  column  arc  averages  for  cases  disposed  of  at 
the  stage  indicated. 

It  was,  to  repeat,  the  expectation  of  those  who  favored  the 
new  rule  that  many  separated  trials  would  end  after  the  liability 
verdict,  thus  making  the  damage  trial  unnecessary.  As  can  be 
seen,  Table  2 vividly  confirms  this  expectation:  fully  78  per  cent 
of  the  regular  trials,  but  only  15  per  cent  of  the  separated  trials, 
go  through  their  full  course.  And  since  many  more  regular  trials 
run  their  full  course  than  do  separated  trials,  it  must  be  expected 


D'' 
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that  the  average  trial  time  for  each  is  quite  different.  The  figures 
for  trial  time  in  Table  2 bear  this  out,  showing  that  the  average 
regular  tnal  lasts  4.2  days,  the  separated  trial,  3.1  days— -a  time 
difference  of  26  per  cent.  It  would  be  tempting  to  read  this  figure 
as  the  looked-fo”  measure  of  the  lime  saved  by  separation.  Un- 
fortunately, it  cannot  be  accepted  at  face  value  because,  standing 
by  itself,  this  comparison  of  the  separated  and  the  regular  trials 
could  be  quite  misleading. 


II 

At  this  point  we  ask  for  the  reader’s  indulgence  during  a some- 
what complicated  methodological  excursion  that  will,  we  trust, 
make  clear  why  without  further  investigation  we  cannot  trust 
the  comparison  made  by  Table  2.  The  precise  question  is  this: 
Were  the  cases  selected  for  separated  trial  and  the  cases  left  for 
regular  trial  substantially  similar  except  for  the  fact  of  separation? 
If  they  were,  we  may  attribute  the  observed  difference  to  the  fact 
of  separation.  But  if  they  were  not,  we  are  in  trouble.  It  is  the 
same  methodological  problem  that  came  to  the  fore  in  the  now 
famous  debate  over  whether  smoking  shortens  our  life  expectancy. 
The  average  age  at  which  smokers  die  is  lower  than  that  of  non- 
smokers,  just  as  the  average  separated  trial  is  shorter  than  the 
average  nonseparated  one.  Does  such  a statistic  prove  that  smok- 
ing shortens  life,  or  that  separation  shortens  trial  time?  Suppose 
it  is  the  man  who  is  less  healthy  to  begin  with  who  takes  up  smok- 
ing. We  would  then  falsely  attribute  to  smoking  the  effect  of 
shortening  our  lifetime,  when  in  fact  the  average  smoker  had 
a shorter  life  expectancy  even  before  he  began  smoking.  Simi- 
larly, one  may  find  that  deaths  are  more  likely  to  occur  in  the  near 
future  in  families  recently  visited  by  physicians  than  in  those  un- 
attended. Obviously  it  is  not  the  doctor,  but  the  preexisting  situa- 
tion that  calls  for  a doctor,  which  causes  (he  higher  mortality 
rate.  A similar  difficulty  may  apply  to  the  figures  in  Table  2: 
cases  selected  for  separate  trial  might  be  different  to  begin  with. 
If  that  difference-  is  itself  related  to  the  time  saving  we  want 
measured,  then  wo  no  longer  know  to  what  extent  the  apparent 
time  saving  was  due  to  differences  other  than  separation.  That 
was  why  our  ideal  research  design  specified  selection  by  lot  — 
by  avoiding  purposeful  selections  wc  could  avoid  the  possibility 
of  selections  on  some  basis  which  might  have  a bearing  on  trial 
time. 

Wcic  our  separated  trials  selected  in  any  way  that  could  lessen 
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the  validity  of  the  comparison  in  Table  2?  We  know  that  there 
was  purposeful  selection  by  the  judges.  Though  we  do  not  know 
the.  basis  for  their  selection,  a few  statistics  give  us  warning. 
From  Table  2,  we  see  that  t!ie  average  separated  trial  which  actu- 
ally went  to  the  jury  on  the  damage  question  took  4.0  days;  but 
it  took  considerably  longer  — 4.7  days- — to  try  the  regular  cases 
that  ran  their  full  course.9  There  is  thus  some  indication,  though 
no  proof,  that  the  cases  differ  by  selection.  There  is  no  proof, 
because  the  figure  for  separated  trials  is  based  on  only  a small 
fraction  of  cases  that  go  the  full  course;  the  potentially  longer 
cases  might  well  have  been  disposed  of  during  trial.  On  this  view 
the  difference  would  disappear  if  all  trials  were  full  trials. 

Further,  the  figures  in  Table  3 suggest  that  the  two  groups  of 
cases  were  not  selected  at  random.  Only  .3  4 per  cent  of  the  regu- 
lar trials  ended  in  favor  of  the  defendant,  in  contrast  to  56  per 
cent  of  the  separated  trials  - a figure  far  above  the  average.10  Note 
too  that  the  proportion  of  directed  verdicts  is  larger  in  the  sep- 
arated group.  Finally,  there  is  evidence  that  the  regular  trials  left 
over  after  some  cases  have  been  removed  for  separated  trials 
differ  as  a group  from  the  run  of  regular  trials  before  civil  rule 
21,  when  no  cases  were  separated.  Statistics  supplied  us  by  the 
statistical  branch  of  the  Administrative  Office  of  the  United  States 
Courts  indicate  that  of  the  196  personal  injury  jury  trials  con- 
ducted in  this  court  during  the  two  years  prior  to  the  introduction 


TABLE  3 

Outcome  and  Mode  of  Termination  of  Separated  and 
RecularTriais* 


Mode  of  Termination 

Separated  Trlils 
Per  Cent 

Regular  Trials 
Per  Cent 

Settlement 

32 

*4 

Jury  Verdict  for  Plaintiff 

12 

Total  in  Favor  of  Plaintiff 

44 

CG 

Jury  Verdict  for  Defendant 

43 

3^ 

Directed  Verdict  for  Defendant 

13 

3 

. Total  in  Favor  of  Defendant 

5G 

34 

Number  of  Trials 

(69) 

("7) 

•Personal  injury  Jury  trials. 

•The  trial  times  for  the  regular  cases  are  somewhat  high  compared  with  statis- 
tics available  from  other  courts,  but  there  are  no  grounds  for  suspecting  that  this 
should  restrict  the  validity  of  our  finding?.  Wc  arc  concerned  only  with  the  rela- 
tive trial  time  of  separated  and  regular  trials,  and  this  relation  should  not  be 
affected  by  the  g^nerady  longer  trial  time. 

10  Sec  note  6 supra. 


Hans  Zfti6el  and  Thomas  Callahan 


of  rule  21,  only  62  per  cent  of  tlje  cases  went  to  a jury  verdict, 
as  against  73  per  cent 11  of  the  regular  trials  after  rule  21.  This 
is  another  suggestion  that  judges  might  have  been  more  likely 
to  order  separate  trial  for  those  cases  which  held  a promise  of 
not  going  through  a full  trial 


III 

So  much  for  the  potential  infirmities  of  the  data  in  Table  2.  Do 
our  data  permit  us  to  move  the  analysis  to  more  secure  ground? 
Table  4 will  show  the  proportion  of  personal  injury  jury  trials 
tried  by  each  judge  under  the  separation  rule:  the  ratio  varies 
from  S9  per  cent  for  Judge  A down  to  zero  for  Judges  K and  L. 
Though  this  varying  use  seems  at  first  blush  unfortunate,  it  is 
this  very  variation  which  will  help  to  solve  the  problem.  For 
clarification,  we  return  to  the  smoking  analogy.  Just  as  we  could 
not  dictate  when  judges  were  to  use  split-trial  procedure,  we  could 
not  select  youngsters  at  random,  ordering  one  group  to  smoke 
and  another  not  to.  But  we  can  expose  one  random  group  of 
smokers  to  “step-smoking”  propaganda  and  keep  another  random 
group  of  smokers  as  a control  group,  unexposed  to  this  propa- 
ganda. We  could  then  compare  the  longevity  of  the  “exposed” 
group  with  that  of  the  control  group.  Any  difference  could  be 
attributed  to  the  “stop-smoking”  campaign. 

Let  us  try  to  see  more  precisely  how  this  analytical  procedure 
bypasses  the  difficulty  that  the  decision  to  stop  smoking  will  not 
be  made  at  random,  but  more  likely  by  persons  who  arc  in  some 
way  special  with  respect  to  smoking  and  with  respect  to  health 
in  general.  The  point  is  that  in  this  design  we  neither  know  nor 
need  to  know  how  the  self-selection  works. 

Let  us  assume  the  worst  possible  case  from  the  point  of  view 
1 of  selection  bias:  suppose  the  only  smokers  who  respond  to  the 
I stop-smoking  campaign  were  those  who  started  smoking  only  re- 
cently and  who  arc  at  the  same  time  particularly  health  conscious. 
Let  us  assume  also  that  it  is  only  their  lengthened  life  span  which 
would  cause  the  average  life  span  of  the  exposed  group  to  in- 
crease. Even  this  extreme  self-selection  would  in  no  way  affect 
the  validity  of  the  proposition  that  reduced  smoking  increases 
life  expectancy.  The  reason  wc  would  not  be  concerned  with  this 
self-selection  is  that  wc  know  that  in  the  control  group,  too,  there 
would  be  these  health-conscious  smokers  who  only  recently  began 
to  smoke,  in  about  the  same  proportion  as  thc-y  occurred  in  the 
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experimental  group.  And  we  should  not  care  if  some  of  the  con- 
tro1  group  stopped  smoking  on  their  ' wn,  without  exposure  to 
the  propaganda.  For  it  is  quite  sufficiei..  to  know  that,  whatever 
the  control  group  does,  the  experimental  group  also  does,  in  addi- 
tion to  ihe  experimental  group’s  reaction  to  the  stop  smoking  cam- 
paign. 

Now  instead  of  two  groups  of  smokers,  let  us  lake  two  groups 
of  trials,  one  conducted  by  Judge  A’  who  separates  some  of  his 
tiials,  and  one  by  Judge  Y who  separates  none.  Since  the  court 
assigns  cases  at  random,  the  two  groups  of  trials  will  be  com- 
parable. The  only  differences  between  the  groups  will  be  that 
there  was  separation  in  one  but  not  in  the  other,  and  conceivably 
the-  differing  expeditiousness  of  Judges  A’  and  F.  If  the  latter 
factor  is  discounted,  and  if  the  average  trial  length  in  the  group 
of  cases  where  separation  sometimes  occurred  is  below  that  of 
the  cases  where  it  was  not  available,  we  are  justified  in  ascribing 
this  difference  to  the  separation.  One  would  then  expect  Judges 
K and  L,  who  held  no  separate  trials,  to  show  a higher  average 
length  of  all  their  trials  than  Judge  A,  who  separated  in  S9  per 
cent  of  his  trials.  In  reality,  of  course,  there  arc  not  two  judges, 
but  several  judges,  each  having  a different  propensity  to  separate. 
We  should,  therefore,  expect  this  proposition  to  hold  true:  the 
larger  the  proportion  of  separated  trials  for  the  particular  judge, 


TABLE  4 

Separation  Ratio  and  Average  Trial  Time 
in  Personal  Injury  Jury  Trials 


Judf?e 

Proportion  of  C»*c* 
'tried  f.’ndcr  Rule  n 
(Per  Cent) 

Aveme  Length  of 
AH  Trl.il*  Before 
ThI*  Jud;e 
(Day.) 

Number  of 
Trialj  Drfore 
Thu  Judge 

,0 

89 

3-2 

~ (76) 

p 

51 

3-3 

(41) 

c 

SO 

*•3 

(10) 

D 

38 

3-5 

(26) 

R 

29 

6.i 

(7) 

F 

20 

5-6 

(S> 

G 

17 

4.2 

(6) 

II 

14 

3-8 

(.a) 

l 

7 

3-9 

(37) 

J 

7 

4-3 

(14) 

K 

0 

4-5 

(1) 

L 

0 

4-5 

(1) 
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the  .shorter  the*  average  length  of  oil  his  (rials,  In  onr  case,  we 
should  expect  Judges  .1  and  ft,  who  have  lire  highest  separation 
rates,  to  show  lower  average  trial  lengths  for  all  their  (rials  Ilian 
Judges  /,  ),  K,  and  /„.  Table  presents  the  pertinent  statistics. 
It  is  as  we  expected:  as  the  separation  ratio  increases,  the  average 
trial  length  decreases.  The  e.v.tcl  nature  of  this  relationship  be- 
comes even  clearer  if  we  put  Table  4 into  graph  form  as  in  Table  5. 


TADLE  5 


Pir  Ci:nt  Summation-  and  Trial  Time 


Ter  Cent  $<j  irctien 

O Judge*  with  1 1 out  ot  ttiwr  * 
Q Judges  with  more  \h»n  14  fists* 

* In  ifi  injury  jury  I* bis  wlxllitt  or  toi. 

1 if*  by  klicr  u ikv  h l>bk  4. 


Generally.  Tabic  5 shows  a band  of  dots  sloping  downward 
toward  the  right-hand  borderline  and  thus  confirms  out  expecta- 
tion that  the  average  trial  length  decreases  as  the  proportion  of 
separate  trials  per  judge  increases.  As  will  be  noted,  the  individual 
joints  representing  the  various  judges  are  far  from  forming  one 
line  which  would  represent  the  relationship  between  separation 
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and  trial  length;  they  show  great  individual  difference*.  These 
differences  derive  from  two  causes.  First,  from  the  fact  that 
some  judges  will  try  their  cases  more  cxperiitfeusly  than  others; 
if  so,  '.heir  “points’’  will  lie  somewhat  lower  than  they  would  if 
they  tried  their  cases  at  average  speed.  Second,  from  the  fact 
that,  although  cases  arc  assigned  to  the  judges  at  random,  some 
judges  will  he  assigned  more  time-consuming  ca«cs  than  ethers. 
The  distorting  effects  of  both  those  factors  arc  automatically  re* 
duccd  to  the  extent  that  the  number  of  judges  (points)  glows, 
so  that  the  “slower”  and  “faster”  ones  balance  each  other  — pro- 
vided that  there  is  no  consistent  correlation  between  "fast”  judges 
and  their  propensity  to  separate.1*  The  distorting  factors  arc  also 
reduced  to  the  extent  to  which  the  individual  judges  have  more 
cases  and  hence  are  likely  to  have  on  balance  lire  same  average 
assortment  of  long  and  short  cases.  In  line  with  this  reasoning, 
we  wi1’  disregard  all  judges  who  had  less  than  fifteen  cases.  If 
we  then  look  at  the  five  remaining  Judges,  /,  If,  D,  /?,  and  A,  we 
find  them  to  fall  nicely  about  the  straight  line  we  have  drawn 
in  Table  5. 15  The  two  crucial  points  along  this  line  arc  its  begin- 
ning and  its  end.  It  begins  (at  separation  zero)  with  3.S  days  and 
ends  at  the  right-hand  margin  (separation  100  per  cent)  at  3.0 
days.  Thi:  lino,  the  rmich-looked-f1' ' unbiased  estimate,  suggests 
that  if  the  * ourt  had  tried  all  its  personal  injury  jury  eases  under 
the  separation  rule,  it  would  have  saved  o.S  days  out  of  3.8  days, 
or  i t jx'r  cert  of  the  total  trial  time. 

IV 

An  additional  table  provides  further  insights  into  the  means  by 
which  lime  is  saved  through  the  separation  process:  Table  6 
compares  the  stage  and  mode  of  termination  of  tegular  and  sep- 
arated trials.  The  figures  from  Table  i,  telling  us  at  what  stage 
the  cases  terminate,  form  the  right-hand  margin  column  of  Table 
6;  those  from  Table  3,  mode  of  disposition,  form  the  last  lines  for 
regular  and  separated  trials. 
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fttikitftt  natabvf  of  toojvrjr  trI*U  for  euk  )«<!£*,  of  ilMistks  on  Hs 
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TABLE  6 

Stage  and  Mode  or  Deposition  or  Pcrsonal  Injury  Jury  Trials 


VrrJ'cl  for 
D<  fttilM 


f»i;«  of 
T«  rrn'mlloo 


Suited 


r>’rt(»r4 

VtrJitt 


Jar  y 
VtrM 


)\ny  Vrtdfcl 
f t-r  rut&tij 
tft  L'lbrL'tJT 

ird  Dirufc* 


TOTAL 


During  Plaintiff’* 

Case  i $% 

At  End  of  Plaintiff's 
Case 

During  Pefcndan 
Case 

After  Full  Trial 
Total 


During  Liability 
Trial 
At  End  of  Liability 
Trial 

During  Damage 
Trial 

After  Full  Damage 
Trial 

Total 


A.  Regular  Trinlt 


i$% 


3% 

s 

1% 

• • 

4% 

3% 

i% 

31% 

43% 

?s% 

21% 

3% 

“31% 

'"*42% 

*100%' 

B. 

Secreted  Tfiots 

10%  * 

f 

9% 

• • 

»9% 

15% 4 

4% 

61% 

4%4 

4% 

3%  4 

, . 

, . 

t>% 

*5% 

32%  ' 

'“13% 

*“13% 

"“12% 

1C0%* 

* it?  . i ills. 

* £9  trials. 

* Settled  Wore  juiy  r'U-incd  any  \ndict. 

‘Scttkd  after  Jury  \ c:<.’’ct  »f>*tr.i5rj  liability  bvl  Were  jyiy  ittuintd  v tid'd 
en  d-m»fc«.  Tbr-t  case?,  totaling  n prr  ernt  (ij-M  + j),  rcpttscat  those  ta 
v hkh  the  jirry  found  for  the  phintid  w>  Jhbility  but  never  dclib«*trd  on  the 
rjvcsrinn  of 


If  we  look  first  fit  regular  trials,  we  sec  that  iS  per  cent  of  all 
regular  trials  were  terminated  through  settlement  dining  the  plain- 
tiff's ease,  and  that  settlement  at  a.  later  stage  is  much  less  fre- 
quent. Then  we  turn  to  the  figures  for  separated  (rials.  Here  we 
see  confirmed  our  hypothesis  that  the  bulk  of  cases  disposed  of 
at  the  end  of  the  liability  trial  terminate  in  a verdict  denying  lia- 
bility: of  the  6i  per  cent  terminating  at  that  point,  47  or  roughty 
three-fourths,  end  with  verdicts  for  the  defendant.  But  the  un- 
expected clement  concerns  those  cases  in  which  the  jury  finds 
for  the  plaintiff  on  the  issue  of  liability.  We  fu.d  an  increased  will- 
ingness on  the  part  of  defendants  to  settle  once  they  have  lost  the 
liability  issue.  Fifteen  per  cent  of  all  trials  end  through  settle- 
ment after  the  jury  affirms  liability,  another  4 per  cent  are  settled 
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during  (lie  damage  trial,  and  5 per  cent  al  (he  end  of  the  damage 
(rial.  Thus,  of  (he *  *'  j + 4 3 + 12  -)  34  per  cent  of  all 

cases  where  the  jury  finds  for  the  plaintiff,  only  19  per  cent  reach 
a (rial  on  the  damage  issue  and  only  12  per  cent,  or  about  one- 
third,  reach  a second  jury  verdict.  In  short,  separation  may  save 
trial  time,  not  only  in  cases  ending  in  defendants’  verdicts  but  in 
cases  resulting  In  plaintiffs’  verdicts  as  well.  Once  liability  is 
affirmed  these  cases  loo  arc  very  likely  to  be  settled  and,  hence, 
will  benefit  from  separation  of  tlie  issues. 

These  figures  explain  why  it  would  be  so  difficult  to  predict  at 
tlie  time  of  separation  which  cases  ace  likely  to  benefit  from  it. 
A prediction  that  the  case  is  likely  to  end  for  the  defendant  is  not 
enough,  because  the  cases  that  go  for  the  plaintiff  on  liability  also 
are  likely  to  benefit  from  separation.  To  select  effectively,  tlie 
judge  would  have  to  be  able  to  identify  in  advance  the  t;vo  groups 
of  cases  that  will  not  benefit  from  separation.  These  are  (1)  the 
15  per  cent  of  cases  which  go  through  a full  trial  despite  separa- 
tion, ar.d  (2)  those  cases  whose  early  termination,  though  after 
the  liability  trial,  cannot  be  credited  to  separation,  since  they 
would  have  ended  prematurely  through  settlement  or  directed 
verdict  even  if  they  had  been  tried  rcgulatly.  The  fust  two  figures 
in  the  right-hand  column  of  Table  6,  adding  up  to  (iS  + 4 ) 22 

per  cent,  are  a measure  of  the  f requeue)  of  these  cases.  We  now 
appreciate  why  Tables  2 and  5 come  out  so  similarly.  The  "pur- 
poseful" selection  by  the  judge  Is,  al  least  with  respect  to  the 
expected  time  saving,  not  too  different  from  a selection  by  lottery. 

V 

Now  v.c  may  consider  a third  approach  to  the  problem  of  esti- 
mating how  much  time  i<  s.red  by  the  separation  procedure.  It 
Is  dear  that  the  entire  tim*  saving  comes  from  avoiding  the  liti- 
gation of  damages,  either  in  cases  where  liability  is  rejected,  or  in 
cases  which  are  settled  after  liability  is  affirmed.  We  can  then 
roughly  estimate  the  lime  saved  if  we  know  three  items;  The  pro- 
portion of  trials  that  go  their  full  course  to  verdict  if  separation 
is  not  Available,  the  proportion  of  separated  trials  that  go  the  full 
course  to  a second  verdict,  and  the  share  which  litigation  of  dam- 
ages constitutes  of  the  total  litigation  time  in  a full  trial.  We 
know  the  first  percentage  to  be  6:  per  cent;  u from  Table  6 v.e 
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learned  that  only  12  per  cent  ,r‘  of  the  separated  trials  went  to  a 
second  verdict.  This  means  that  separation  cause?  about  (62  — 
is  **)  50  per  cent  of  the  trials  to  terminate  without  running 
their  full  course.  As  to  the  share  of  total  litigation  time  devoted 
to  the  damage  issue,  we  must  rely  on  expert  estimates,  although 
theoretically  one  con'd  produce  an  actual  time  count  on  a repre- 
sentative sample  of  trials.  Conversations  with  judges  and  lawyers 
have  led  us  to  conclude  that  on  the  average  the  damage  issue  con- 
stitutes at  the  most  <0  per  cent  of  the  total  trial  time.  Armed 
with  these  figures,  we  can  perforin  the  following  computation: 
If  each  of  the  roughly  50  per  cent  of  the  cases  in  which  litigation 
of  damages  is  avoided  involves  a saving  of  40  per  cent  of  trial 
time,  then  the  saving  from  separation  would  amount  to  20  per 
cent  of  the  total  trial  time.  This  result,  in  spite  of  the  roughness 
of  its  basis,  is  close  enough  to  our  other  findings  to  permit  now  the 
firm  generalization  that  in  personal  injury  jury  trials  scparolion 
seres  hid  time  of  the  magnitude  of  about  eo  per  cent. 

VI 

An  important  limitation  should  now  be  noted  00  the  amount  of 
lime  which  courts  may  actually  save  through  the  separation  de- 
vice. All  calculations  so  far  are  based  on  the  assumption  that  oil 
rases  would  be  tried  under  the  separation  rule.  Out,  as  Table  1 
indicated,  in  this  court  only  37  per  teat  of  the  personal  injury 
trials  are  separated,  and  hence  the  time  saved  thereby  is  not  20 
per  cent  but  only  57  per  cent  of  this  20  per  cent,  that  is  (.37  X 
’o  «)  7.4  per  cent  of  the  total  trial  time.  We  also  know,  how- 
ever, from  Table  4,  that  this  average  of  37  per  cent  is  not  the 
same  for  all  judges.  We  know  from  conform  with  Judge  d, 
who  had  per  cent  of  his  trials  separated,  that  he  ordered  sep- 
aration as  a matter  of  routine,  unless  cause  was  shown  as  to  why 
separation  should  not  be  ordered.  Judges  / and  7,  on  the  other 
hand,  apparently  proceeded  with  regular  trials  unless  a special 
reason  was  advanced  for  ordering  separation.  We  would  con- 
clude, therefore,  that  to  per  cent  is  a good  enough  estimate  of  the 
proportion  of  cases  which,  under  the  most  liberal  application  of 

"Iri  addition,  cf  ihe  WfvMrJ  tan*  vtrt  retlkd  after  a la!  Irial,  Wt  do 
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the  separation  rule,  would  be  loft  to  the  regular  mode  of  trial. 
And,  consequently,  we  conclude  that  if  the  separation  rule  were 
applied  liberally  the  overall  time  saving  would  not  reach  so  per 
cent  but  only  (.90  X so  ** ) 18  per  cent. 

VII 

With  this  we  reach  the  end  of  our  efforts  to  measure  the  amount 
of  trial  time  saved  through  separation.  The  remainder  of  this 
essay  is  concerned  with  factors  that  could  conceivably  operate  In 
the  opposite  direaion,  increasing  the  court’s  burden  of  trial  time.1* 
The  most  obvious  one  might  br>  that  the  amount  of  time  needed 
for  the  jury’s  deliberation  in  separated  eases  — where  the  jury 
may  deliberate  twice  ■ — may  be  greater  than  in  regular  trials. 
Second,  separation  could  conceivably  reduce  the  number  of  jury 
waivers,  increasing  the  proportion  of  the  more  time-consuming 

u before  proceeding,  \\  L?  netmaty  to  report  on  two  minor  housekeeping  de* 
lath.  One  concerns  the  court*!  u?e  of  Import  hi  medical  expert?;  the  ether,  tvr 
treatment  of  13  cases  hi  v.hkh  ctily  thi  damage  bsue  was  trkd  because  liabi’ity 
was  ilipuhtfd. 

At  the  time  th:  separation  rule  wax  Introduced  the  court  put  lr.ro  effect  * not)  cr 
innovation.  The  judges  In  their  discretion  were  enabled  to  appoint  ?n  Impart  ill 
medial  expert  If  th*  adversary  experts  were  too  hr  apart.  Since  It  could  It  argued 
that  lueh  experts  cou!J  affect  trill  time  and,  hence,  out  analysts,  wc  give  herewith 
the  relevant  d.ita.  They  carry  the  string  sugeesiion  that  these  exp'-ts,  whatever 
their  cfkct,  did  tot  distort  out  analysis  ol  trial  separation.  fourteen  pet  cent  of 
the  serrated  trial?,  and  I per  cent  of  the  regular  trial*,  had  such  expert.*.  The 
MTfape  length  of  tfcctt  trial?  was  r.J  dn*  for  the  separated  ard  5.3  for  the  regular 
Irish.  There  Is  thus  no  significant  division  from  the  general  pattern.  In  any  evert, 
as  we  have  pointed  out  elsewhere,  the  primary  effect  of  the  impattnl  expert  Is 
likely  to  be  Ms  Increasing  the  likelihood  of  settlement  ttjert  trial.  t>;t  xv 

Concerning  the  13  case*  ?\i:h  stipulated  liability,  the  Question  had  to  be  answered 
as  to  *heth*r  these  ? tip;  lit  ion  5 would  have  taken  place  if  the  separation  rule  had 
rot  exktcd.  Defense  counsel  might  hive  simply  preferred  not  to  kt  the  jury  know 
Just  how  r/gKgerl  his  tlkal  hid  been,  for  purposes  of  our  statistics  we  treated 
the  c**es  as  fcTovs;  If  the  juice  separated  the  Issues  we  assumed  that  admission 
of  BiUEty  v.ii  In  response  to  the  scyaiathoft  order.  Out  cf  the  tj  caws  where 
liahiTrty  was  * dn.il  tcJ,  3 feH  Into  this  category.  The  rcmAinirg  to  cues  were 
Counted  an  off  the  ttgul'.r  trills  vr.de  r the  assumption  thit  here  KaVilily  would 
have  been  admitted  even  if  the  case  had  reached  trial  prior  to  the  adoption  of 
the  iepira*k>ft  tuk\  This  treatment  probably  coiiCcdes  k>*  snf.uer ce  \o  the  scpxn* 
taon  rule  thar.  It  deceives.  The  10  cues.  treats  as  rrcuhr  trials  represent  (to  oat 
of  tt)  e)  ^ of  all  rcfiht  In'ik  AUhough  we  hivt  to  strthtks  on  such  casts 
for  the  time  prrr  to  the  telris  adoption,  it  seems  unlikely  — and  the  judges  have 
confirmed  this  In  theft  conferences  with  w«~tha|  this  pete tr.Uge  was  as  high 
as  that.  We  prrb'.SJy  err,  thereiare,  ty  Only  3 cases  cf  admlrtel 

lability  to  the  sepnubn  r.k.  This  ertor  will  teed  to  male  the  average  trial 
tine  foe  rerJn  c.ki  sh*rt>\  and  for  the  sep^attd  case  longer,  than  it  actuary 
Has;  fc  see,  it  pro!  ibly  cat  ts  us  to  ut.d?re*tjnx*t  til  nrirp  attributable  to  fe;* 
a ration. 
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trials  before  juries.11  Third,  separation  might  increase  lire  num- 
ber of  hung  juries,  and  hence  (he  number  of  eases  (hat  have  iO  be 
tried  twice.11  Fourth,  separation  might  reduce  (he  proporlion  of 
cases  settled  prior  to  trial,1*  and  thereby  increase  the  number  of 
cases  that  need  to  be  tried.  The  suspicion  that  separation  of  issues 
may  lead  to  an  overall  slackening  of  trial  speed  has  already  been 
laid  to  rest,  especially  by  Table  i,  which  showed  that  the  separated 
trials  that  go  to  two  verdicts  do  not  last  longer  than  full  regular 
trials.  Wc  turn  now  to  the  investigation  of  the  possible  offsets 
enumerated  above. 

A.  Increased  Time  jor  hay  Deliberation? 

The  first  point  is,  in  any  event,  one  of  relatively  minor  impor- 
tance, since  jury  deliberation  time  is  not  necessarily  court  time. 
Nevertheless,  it  will  be  of  interest  to  see  the  pertinent  figures  in 
the  following  table. 


TABLE  7 

Average  Jury  Dei  ile ration  Time  and  Frequency 


Tin*  x 

r«r  om  d 

7V»U  it 
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Thus  we  see  that  the  jury  in  separated  trials,  If  it  deliberates 
twice,  spends  an  average  of  (3.4  + ».»=)  5.6  hours  in  the  jury 
room  as  against  an  average  of  only  g.j  hours  in  regular  trials.  This 
could  be  an  offset,  except  that  in  only  11  per  cent  of  the  separated 
trials  the  jury  was  called  upon  to  deliberate  twice.*'  The  last  col- 
umn of  the  table  takes  into  actount  the  frequency  of  deliberation, 
giving  an  average  deliberation  time  for  all  eases,  including  those 
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In  which  tlic  jury  deliberated  only  once  or  not  at  all.  With  dc* 
liberation  time  thus  weighted,  the  effect  of  separation  is  favor- 
able rather  than  unfavorable;  the  table  reveals  that  the  jury 
spends  on  the  average  0.37  hours,  or  about  :j  minutes,  less  de- 
liberation time  in  the  average  separated  trial. 

!i.  Decreasing  Frequency  of  Jury  Waivers? 

On  the  second  point,  the  frequency  of  jury  waivers,  the  evi- 
dence is  equally  reassuring.  There  is  no  ground  for  believing 
that  the  proportion  of  demands  for  juries  increases  under  the 
separation  rule;  in  fact  the  evidence  seems  to  point  the  other  way, 
though  the  difference  is  negligible.  During  the  two-year  period 
before  separation,  juries  were  used  in  90  per  cent  of  all  personal 
injury  cases;  51  during  the  corresponding  period  after  separation, 
in  only  S 6 per  cent.*1  Furthermore,  if  separation  were  to  reduce 
jury  waivers,  one  would  expect  the  judges  who  are  more  likely 
to  order  separation  to  show  a lower  incidence  of  jury  waivers. 
Hut  Table  S,  which  presents  the  relevant  data,  shows  no  evidence 
to  this  effect. 


TA1ILK  t> 

Separation  Ratio  and  Jury  Waiver  in 
Personal  Injury  Trims 


Ttt  CtM  of 
Ti>i  Vft4«-r  R:!e  j 1 * 

iet  Oft 

W iittt 1 

Setter  0/ 

TrY*  t'tc'e 

i>U  1 

A 

81 

n 

(33) 

B 

si 

9 

(45) 

C 

45 

9 

Or) 

I) 

16 

<30 

K 

*9 

0 

(7) 

F 

*7 

>7 

(6) 

V 

»4 

»4 

(7) 

n 

tl 

t8 

0?) 

1 

6 

tt 

(33) 

j 

7 

0 

(O) 

K 

0 

0 

0) 

L 

0 

0 

0) 

I Kfotilk*  titio*  difet  from  tSo?*  it  Tibfe  4 bfciaw  ilxy  ire  tiKtl  cn 

**1  triiH  by  this  ju  r cr.ljr  on  bis  jvfy  If'*.'!*. 

C.  Increase  r;r  flung  Juries? 

As  to  the  third  possible  oi: set  against  the  savings  achieved  in 
trial  time,  a possible  increase  in  the  number  of  hung  juries,  they 
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u.o  as  rare  an  event  row  e.s  they  were  prior  to  the  row  rule.5’ 
Ilut  even  without  such  specific  evidence  it  would  seem  unlikely 
that  any  major  offset  could  be  expected  from  this  source.  Even 
if  the  ordinary  3 txr  cent  rate  of  mistrials  bad  been  doubled  under 
the  new  rule  (which  i*  was  net),  the  offsetting  loss  of  trial  time 
would  be  only  about  1.5  per  cent,  since  a hung  jury  is  a total  loss 
only  in  about  half  of  all  instances;  in  the  other  half  the  litigants 
take  the  jury’s  hint  and  settle,  so  that  there  is  no  need  for  a retrial. 

D.  Decrease  in  Settlement  Ratio? 

On  the  fourth  issue,  the  settlement  ratio,  the  evidence  is  given 
in  Table  9,  which  shows  that  the  proportion  of  cases  reaching 
trial  has  not  changed  at  all. 


TABLE  9 

Sei ! lenient  Ratio  in' Jurv  Cases  BrrORr  and  After  Ruff,  it 


T*t*Yt*t  ht>1  Nfurj  IV 0- Yr  it  ferri  WitV 


K Irlhtf 

Ttf  C**l 

N**ntaf 

Th  Ctfti 

Hdore  Tri.il 

lilS 

HI? 

T(,9% 

Reached  Trial 

a * ? 

»J-*56 

216 

15.1% 

Total  JVrioml  Injury  Jury 
Cases  Filed 

*435 

IOO.O^> 

MJJ 

*00  0% 

That  the  separation  rule  is  not  likely  to  affect  the  settlement 
ratio  can  also  be  appreciated  on  a priori  grounds.  The  likelihood 
that  parties  will  settle  is  not  a function  of  the  relative  strengths 
of  their  |>ositions,  but  rather  of  the  clarity  with  which  each  sees 
the  other’s  position,  since  a settlement  atlscs  when  each  party 
can  weigh  the  other’s  chances  and  agree  with  him  on  what  those 
i chances  are.  Whatever  possible  difference  the  rule  could  make 
to  the  verdict  expectation  of  the  litigants,  there  is  no  reason  to 
t believe  that  such  a difference  would  bo  more  visible  to  or.e  side 
than  to  the  other.  Wc  should  expect,  therefore,  that  any  possible 
difference  would  leave  the  litigants  as  likely,  or  as  unlikely,  to 
settle  as  they  had  been  before.*4 
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VIII 

Although  our  Inquiry  has  taken  us  far  afield  and  into  several 
side  forays,  the  task  has  been  a narrow  one  to  measure  the 
effect  of  separation  of  Issues  on  the  court’s  trial  load.  This  then 
fs  a summary  of  our  findings: 

Separation  of  Issues  will  save,  on  the  average,  about  jo  per 
cent  of  the  time  that  would  be  required  if  these  cases  were  tried 
under  traditional  rules.  This  saving  derives  from  the  fact  that 
in  many  cases  separation  makes  the  litigation  of  damages  un- 
necessary. Tills  group  Includes  all  cases  in  which  liability  is  de- 
nied, but  also  ibe  majority  of  cases  In  which  liability  Is  affirmed, 
because  two  out  of  three  of  these  cases  arc  likely  to  be  settled 
without  trial  of  the  damage  issue.  Thcic  is  no  evidence  lint  this 
saving  fs  offset  by  a change  in  the  settlement  ratio  prior  to  trial, 
in  the  frequency  of  jury  waivers,  or  in  the  proportion  of  hung 
juries,  any  one  of  which  factors  — if  affected  ~ could  Inctcase 
the  court’s  trial  load. 

It  is  not  possible  to  sort  out  effectively  in  advance  the  cases  in 
which  separation  would  prove  futile.  Therefore,  if  a court  wants 
to  realize  the  maximum  of  potential  time  saving  through  separa- 
tion, it  should  separate  as  frequently  as  possible.  The  time  saved 
will  be  in  direct  pro<v>ttion  to  the  frequency  of  separations.  Such 
a policy  recommends  itself  also  on  the  ground  that  separation  is 
unlikely  ever  to  edd  substantial  trial  time  even  if  cases  should  go, 
as  some  do,  to  a second  verdict.  If  a court  were  to  follow  Judge 
A'*  example  and  order  separation  routinely  unless  cause  is  shown 
to  the  contrary,  it  could  expect  separation  in  about  qo  per  cent  of  all 
cases.  On  this  level  of  separation  the  overall  saving  could  be  ex- 
pected to  amount  to  roughly  iS  per  cent  of  the  comt's  trial  time. 

Viewed  superficially,  the  route  which  took  "$  to  this  result 
might  seem  overlong.  Yet  such  is  the  nature  of  the  social  sciences 
that,  short  of  a controlled  experiment,  all  their  evidence  is  imper- 
fect; onty  through  a variety  of  converging  approaches  can  one 
hope  to  reach  safe  ground.*5  Hut  from  our  investigation  we  have 
learned  that  separation  Is  a powerful  remedy  for  court  conges- 
tion. Kully  used,  it  would  be  equivalent  to  increasing  the  number 
of  judges  ttying  those  cases  by  one-fifth.  Ca <es  which,  if  not 
separated,  will  require  five  judges  for  their  disposition  will,  if 
separated,  require  or.ly  four  judges  and  free  one  for  other  work. 

The  introduction  of  the  separation  rule,  as  we  had  occasion  to 
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observe,  raised  oilier  questions  beyond  its  efficacy  as  a delay  rem* 
ccly.  These  questions  concern  llie  possible  effect  of  separation 
c>n  the  substance  of  the  jury  verdicts.  By  depriving  the  jury  of 
its  joint  verdict,  it  is  argued,  subtle  influences  operate  to  affect 
the  verdict,  and  should  therefore  be  included  In  an  overall  ap- 
praisal of  the  rule.  We  arc  now  in  the  process  of  collecting  data 
which  might  have  a bearing  on  this  problem;  If  findings  prove 
significant,  they  will  appear  in  a subsequent  report. 
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The  largest  share  of  our  attention  has  been  devoted  to 
quantitative  data  rather  than  qualitative  data.  Our  emphasis 
has  been  due  more  to  the  complexity  of  quantitative  analysis  than 
to  judgments  about  the  value  of  qualitative  versus  quantitative 
materials.  In  fact,  we  feel  that  qualitative  material  is  valuable 
to  tne  social  sciences  and  that  any  discussion  of  social  science 
methodology  would  be  lacking  if  the  student  were  not  made 
aware  of  its  value, 

hi  general,  qualitative  research  is  used  in  two  capacities: 
(1)  it  serves  as  a first  step  toward  quantification  by  discovering 
"unexpected  phenomenon  which  stimulate  a search  for  causa! 
explanations;"  and  (2)  it  offers  the  researcher  a different  kind  of 
approach,  a "substitute  for  statistical  research  making  the  same 
kind  of  statements  but  on  the  basis  of  recording  and  analysis  , . . 
which  takes  place  largely  within  the  mind  of  the  observer. " The 
following  paper  by  Allen  Barton  and  Paul  Lazarslold  Is  an  ex- 
cellent statement  of  the  role  of  qualitative  analysis  in  social 
research  and  reveals  its  great  Importance.  The  great  bulk  of 


man's  behavior,  of  course,  is  still  far  from  being  easily  measur- 
able—if,  indeed,  many  of  its  most  significant  aspects  can  ever  be 
measured  in  the  traditional  sense. 

There  is  one  particular  aspect  of  the  following  selection 
which  is  particularly  worth  noting.  Qualitative  data,  like  quantitative 
data,  can  be  systematically  ordered  and  analysed  even  though  it 
does  not  offer  the  same  precision.  As  Barton  and  Lazarsfeld  point 
out,  the  researcher  must  organize  the  raw  observations  into  a 
descriptive  system.  These  descriptive  systems,  as  the  authors 
indicate,  can  range  from  crude  lists  of  types,  each  defined  indi- 
vidually to  fully  systematic  typologies  in  which  each  type  is  a 
logical  compound  of  a small  number  of  basic  attributes.  What  all 
such  typologies  have  in  common,  irrespective  of  their  complexity, 
is  that  from  them  one  can  derive  a classification  of  the  values, 
habits,  and  attitudes  which  are  important  to  the  explanation  of 
social  behavior.  Thus  Louis  Wirth's  typology  of  the  Jewish 
community,  involving  the  social  roles  of  Mensch,  the  Allrightnick. 
the  Schlemihl,  the  Luflmensh  , the  Ycshiba  Bochar , and  the  Zaddick, 
give  us  as  complete  an  index  as  any  obtainable  of  the  culture  traits 
and  the  culture  pattern  of  the  group. 
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From  Allen  H.  Barton  and  Paul  F.  Lazarsfeld,  "Some 
Functions  of  Qualitative  Analysis  in  Social  Research," 
Frankfurter  B o i t r a g e Zur  Soziologie 
(Band  1,  1955),  pp.  321-361.  (Reprinted  from  Bobbs- 
Merrill  Reprint  Series  in  the  Social  Sciences,  #S-336. ) 
Reprinted  by  permission. 
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Th  c advancc-raem  of  research  procedure  in  locirl  science  cs  elsewhere  depends 
oa  making  explicit  what  researchers  actually  do,  end  s\  stcioatically 
?.r  aly  zing  it  in  the  light  o;  logic  and  of  substantive*  hrovledgc.  Such  a "codi- 
fication” of  procedures  points  out  c!  mgers,  indicate?  neglected  possibilities,  and 
suggests  improvements.  I:  mal.es  possible  the  generalization  of  methe .Jologieal 
knowledge  - its  transfer  from  one  specific  pioject  or  sublet  matter  to  others, 
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fact  h .ive  been  doing  ^ . t'.i  qualitative  materia!.  About  ICO  studies  were  culled 
for  characteristic  examples.  A:'  effort  ww  then  made  to  organise  th esc  caws  in 
order  that  the  most  dtarac.nbpu  types  of  qualitative  work  could  pc  distin- 
guished and  documented.  This  piper  presents  the  resulting  organization. 

live  reader  will  have  no  di fticulty  in  noting  that  this  “guide  through 
qualitative  research"  is  itself  guided  by  proceeding  front  simple  to  ever  more 
complex  procedures.  \vc  begin  with  a discussion  of  the  value  of  simple  obser- 
vations. We  then  proceed  to  those  studies  which  center  on  ordering  and  classi- 
fication, Our  next  group  of  examples  demonstrates  the  various  ways  in  which 
several  variables  arc  interrelated  through  qualitative  analysis.  Next  we  discuss 
cases  where  the  analyst  wants  to  encompass  such  a great  number  of  dimensions 
that  he  cannot  make  them  all  explicit,  but  tries  to  sum  them  up  in  a general 
“patieru,\  This  is  probably  the  point  a:  which  qualitative  research  is  most 


creative,  most  controversial,  and  most  difficult  to  describe.  I:  will  be  seen  that 
we  had  to  use  a special  term  (“matrix  ;oimuIa,,)>  to.  bring  into  relief  tin  * means 
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We  have  a ho  not  dealt  with  the  problem  of  evidence.  Under  what  conditions 
in  the  social  science!  an  r.swriion  is  proved  is  a very  difficult  question,  not 
rest i let c J to  qualitative  r esenreh.  1:  seemed  best  not  to  touch  on  this  issue,  in 
a context  in  which  all  emphasis  teas  placed  on  providing  a pic i mar  of  a hind 
of  work  which  is  usually  considered  so  "private"  that  it  defies  all  systematic 
presentation. 


I.  ANALYSIS  OI  SINGLE  OBSERVATIONS 


When  one  examines  qualitative  reports,  one  of  the  first  types  of  materia! 
v/Mcli  catches  our  attention  Is  the  ^stirprisingob-ervation’Mdhe  the  nets  of  deep- 
sea  explorers,  qualitative  studies  may  pull  up  unexpected  and  striking  things 
for  us  to  gaze  on.  We  find  that  there  arc  people  who  believe  that  they  arc  being 
educated  by  the  unrelated  and  trivia!  information  presented  by  quiz  shows1 * *  • 
Interviews  with  people  deprived  of  their  newspaper  by  a strike  disclose  that 
some  do  not  turn  to  alternative  sources  of  news,  but  to  reading  anything  which 
is  lying  around  the  house;  a major  function  of  newspaper  reading  seems  to  be 
simply  to  fill  in  “gaps”  in  the  daily  routine*.  Observers  of  the  underworld  tell 
u>  that  professional  thieves  constitute  a rather  exclusive  social  group,  with 
exacting  standards  of  membcrdiip  strongly  reminiscent  of  those  of  lawful  pro- 
fessions1. Anthropological  data  of  course  are  fuJJ  of  surprising  oh:  cr;  a dons: 
that  Fddiv.os  lend  their  wises  to  guests  without  any  jealousy,  that  Fiji  blenders 
kill  their  chiefs  when  they  grow  old,  and  *0  on4. 

Those  phenomena  are  of  various  levels:  some  are  individual  beliefs  and  be- 
haviors, some  are  a matter  of  group  standards  and  structures  within  a society, 
some  involve  the  norms  of  a whole  culture.  In  each,  case  the  qualitative  research- 
er has  simply  disclosed  that  $uc!t-anJ-ruJi  a phenomenon  exists.  And  in  one 
way  or  another,  to  be  told  tint  such  things  exist  has  a strong  impact  on  the 
reader,  't  hey  all  have  an  element  of  surprise. 

In  the  next  few  pages  v/c  will  try  to  clarify  what  this  impact  is,  and  what 
research  functions  are  served  by  these  qualitative  ob*ervnthuis  which  simply 
state  the.  existence  of  something  surprising.  We  can  distinguish  at  least  two 
different  u*:s  for  such  observations.  Firs:,  the  e.x-wi.-nce  of  a phenomenon  may 
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raise  problems  - in::  )<,  compel  us  to  look  for  explanations,  to  explore  its 

q iience q to  try  to  Is:  it  into  our  sJteme  of  knowledge.  Second,  we  irny  find 
in  the  qurdk : live  ob>  :rv atm :r;  an  indicator  of  >«>me  gener.:!  variable  which  v e 
'.van:  to  study.  but  cannot  tm:  autre  directly. 

O l/svrz\r:ion<  VhUb  Rune  Problems 

Some  observations  arc  surprising  because  they  conflict  with  our  expectations 
either  common  -semie  or  theoretically  derived.  Other  observations  surpzhc  u« 
by  bringing  to  light  phenomena  which  arc  simply  new  and  unexplained,  which 
challenge  our.  curiosity,  Vet  another  important  type  of  problem-raising  ob- 
servations Is  thi at  winch  Lvir.gs  together  under  a clear  label  a body  of  “familiar** 
experiences  which,  had  not  provioudy  b:en  seen  as  a definite,  g-ener ally  occur- 
ring social  phenomenon  - which  forms  for  the  thst  time,  so  to  speak,  a “social 
object”  to  be  studied. 

In  any  of  these  cases  the  result  U that  a pro!  [cm  L raised.  Our  at  million  i> 
focussed  on  a phenomenon,  and  wc  rue  stimulated.  to  seek  explanations  and 
inquire  into  consequences,  ho  make  such  a problematic  observation  is  to  initiate 
a research  process  which  may  lead  to  significant  advancement  of  our  under- 
standing of  social  phenomena.  (Some  kind  of  observation-  no  doubt  raise  more 
significant  questions  and  lead  to  more  valuable  findings  then  others.  It  wouM 
be  of  great  value  to  develop  ‘hcreerdm;  principled’  which  can  direct  our  aiter- 
tion  to  the  more  signilicaiu  of  the  >vrp.:sin:;  observations;  tin-:  cannot  however 
be  gone  into  here h) 

To  give  con 
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“The  supiTvho.y  coiuro!  vhidi  U s.t  up  by  mar*  vjcnmr.  to  regulate-  and  govern  the 
workers  exercises  little  avahe-my . , 

“They  (the  employee;)  firmly  lubeve  th.it  they  vi'I  no:  be  sethfaao: Ily  remu- 
nerated for  any  additional  work  tb  y produce  over  the  engoy  . . ." 

A II  of  these  facia  were  m conflict  with  what  the  nningement  nr  d the  researchers 
had  expected.  By  following  them  up  the  researchers  were  led  to  their  now  well- 
known  discovery  of  the  importance  of  informal  groups  in  fauna!  organiza- 
tions, and  of  the  barriers  to  communication  which  exist  between  levels  v/khiri 
organizations. 

Communications  research  offers  many  examples  of  surprising  qualitative 
observations.  \Vc  have  already  mentioned  the  discovery  of  unexpected  motives 
- for  listening  to  quiz  programs  and  for  reading  newspapers.  Unexpected 
responses  to  communications  are  another  important  example,  A broadcast 
warning  the  public  to  patronize  X-ray  operators  and  avoid  “quacks”  led 
sonic  listeners  afraid  of  any  X-ray  renumonts,  and  others  doubting  whether 
there  coufd  be  any  X-ray  machines  in  the  hands  of  incompetent  operators1. 
A film  de  signed  co  impress  Americans  with  the  British  war- eft  or  t left  some  mcme 
convinced  than  ever  that  America  was  herring  all  the  burden*.  The  discovery 
of  sum  anomalous  responres  led  to  more  derailed  investigation':  of  the-  coni- 


nnunentions  procc  s,  which,  turner  J up  some  i:.ipu::au:  general  pdnclplco  « for 
cxainnU\  about  the  sued  to  u Luc  u e mersage  to  the  expencnre-vrorld  for  die 
audience,  which  may  be  c;  die  different  from  that  of  the  cnmmtmicamu 

A study  of  bow  prejudiced  people  rupo;  d to  cartoon*  ridT'ding  prejudice 
found  an  unexpected  type  of  response*.  5?o-i ■»  v people  were  icTh.r  shu.-ed  out 
of  then*  pirjuJucs  nor  ins.durd;  t!s:y  simply  did  i.oc  u-,dcr>;an.l  dr; 

cartrens  were  driving  at.  Wlua  this  rc*p* m-r  was  investigate  : i det.vl,  the 
“tier  silmenc  i'v r? ;t: ” of  motivated  Ml>onderst  i^Jlrg  was  revealed  ( : e e 
p.  3 ? ■ hdov). 

Lisuuns  :o  Kate  Smld,\  war-bend  “rir.ratbon  broad placed  remark- 
able emphasis  on  hr:  “shvrevky’b  Comic  .ring  that  other  prof rT 


mat  eut'i- 
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deed”  in  a propaganda-wary  society  - in  this  case,  Race  Snath's  presumed 
strain  and  sacrifice  in  n.ak’ng  the  ! $■  hour  b:ondcasr. 

The  e>  r.mpbcs  so  far  have  involved  u:icvprc:ed  phvnom.vn.t  which  stimulated 
a search  *or  can  a!  explanation*'.  In  other  instance*  the  problem  which  is  raised 
is  in  tl?e  other  di  icction  - an  investigation  of  the  consequences  of  a certain 
phenomenon  is  stimulated.  1 hvs  one  researcher  interested  i;t  problems  c*  the 
profession  noted  an  “obvious”  fact  as  raising  a problem:  while  all  professionals 
meet  a certain  propot tkm  of  failutes,  die  trial  lawyers  as  a group  necessarily 
lose  half  their  cases’.  What  must  be  the  consequences  of  such  a high  rate  cf 
failure  for  these  professionals,  and  hov/  do  they  dcil  with  it?  The  answer*;  to 
these  problems  might  throw  light  on  some  important  problems  of  the  pro* 
fessional  role.  The  ability  to  take  a commonplace  fact  and  see  it  as  raising 
problems  is  important  because  it  can  lead  ultimately  to  such  enlightenment. 

Another  such  observation  was  made  by  Merten  ir»  his  study  of  the  Kate 
Smith  war-bond  marathon.  Iu  the  content  of  the  broadcast  there  was  no  re- 
ference to  the  real  economic  purpose  of  war  bond  buying  a>  an  anti-inflation 
measure.  Merton  saw  this  fact  as  raising  problems  of  consequences:  an  oppor- 
tunity to  educate  a large  and  attentive  audience  in  economic  realities  had  been 
neglected,  and  reliance  bad  been  placed  instead  upon  “large  delusive  state- 
ments” playing  upon  the  anxieties  of  th.o*e  with  loved  ones  overseas.  What 
were  “the  further,  more  i\  mote  but  not  necessarily  less  significant  effects  of 
these  techniques  upon  the  individual  personality  and  the  society?”  “Docs  the 
unehbor.ucd  appeal  to  sentiment  which  displaces  the  information  pertinent  to 
assessing  this  scutunmc  blunt  the  critical  capacities  of  the  ibte;vr<?”: 

An  example  ol  the  forming  ol  cosnr.vinp!  ce  experiences  into  a clearly 
labelled  soiu.il  phenomenon,  and  thereby  creating  a new  object  for  investiga- 
tion, i>  Adler’s  forr.uihvion  o:  t!:e  concept  of  tlie  inteiiorirv  complex’.  hi  very- 
one  at  one  time  or  another  has  experienced  feelings  of  being  Inadequate, 
unworthy,  etc.,  but  until  llv^c  private  *eu>a::  had  been  pointed  out  and 

labelled,  they  could  not  he  investigated  by  social  science.  Other  examples 
which  might  be  called  to  mind  are  Vcble.Vs  formulation  of  the  concept  of  con- 
spicuous cor.'suciption1 * * * 5,  or  Sutherland's  Itbelling  of  certahi  categories  of  bud* 
ness  beli  i\  ior  as  “white  collar  crime"*'.  Without  having  act*: ally  discovered  any 
new  facts,  simply  by  directing  attention  to  Dmiliir  facts  placed  for  the  first 

1 Vtigsi.-r  T)  i.h’ns,  r.sc;..-b  \ i pro^*.  CoItvjiVi.i  l’Tve;*hy,  !>;/•  *i:r  r;r:  c f Sw 
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time  within  a distinctive  category,1  these  investigators  were  able  to  raise  im- 
portant problems  and  initiate  fruitful  stuov. 

The  reader  may  have  noticed  tV.;  some  of  the  examples  of  “sjrprbing  ob- 
servations” cited  are  no  surprise  at  all  to  him.  This  is  as  it  should  be.  The  point 
is  that  one  time  the.,  were  surprising,  and  initiated  further  investigation  which 
has  been  sufficiently  successful  to  render  them  familiar  and  understandable 
today.  Another  problem  is  that  an  observation  r.ny  be  surprising  to  the  par- 
ticular researchers,  while  other  social  scientists  have  known  about  it  all  along. 
Thus  the  Western  Electric  researchers  made  the  “surprising  discovery”  that 
informal  social  organization  existed  among  workers.  Other  sociologists  had 
long  been  aware  of  the  problem  of  informal  groups;  however,  the  special  pre- 
conceptions of  American  industrial  sociology  up  to  that  time  had  kept  it 
unaware  of  this  whole  realm  of  phenomena.  In  a still  more  extreme  case,  a 
finding  may  be  new  only  to  one  particular  researcher:  in  which  ease  it  might 
better  be  termed  “self-cducation”  than  a scientific  dicovery. 

Is  there  anything  which  a researcher  can  do  toward  making  “surprising 
observations”  other  than  to  maintain  an  alert  state  of  mind?  It  may  seem 
contradictory  to  speak  of  giving  instructions  for  nuking  surprising  observa- 
tions, However,  there  are  ways  in  which  one  can  at  lean  increase  the  prob- 
ability of  making  such  observation?.  Sonic  of  these  are  discussed  by  Jahoda, 
Dci.trcli,  and  Cook  under  the  heading,  “The  Analysis  of  ‘ii->ight-Stimul  umg 
Cases’. *M  Strangers  or  newcomers  to  a community  or  a coumry.  it  is  suggested, 
may  be  able  to  pick  out  problematic  facts  which  are  simply  teken  for  granted 
by  those  accustomed  to  the  locale.  ?dargina'  individu:!*,  or  case*  in  transition 
from  one  stage  or  status  to  another,  may  present  much  more  clearly  certain 
problematic  features  of  a personality-type  of  social  system.  Deviants,  extreme 
cases,  or  “pure,  ideal-typical”  cases  may  have  a relatively  high  e;h  ciency  in 
indicating  problematic  1 Acts. 

Obserzs'icy.s  Which  Sent  Ir.JtCAiors 

The  Inst  type  of  surp::.d ng  observations  discussed  were  those  which  were 
anomalous  J unexplained,  which  served  the  function  of  stimulating  a search 
for  explanations.  Another  type  or  qualitative  observations  ne  Jialle  uin 
because  ve  ?s:c  in  them  indie,  tion?  of  some  1:  :ge-'C:.!c  phea::e eno*.  which  v 
cannot  perceive  directly.  T hu>  the  occurrence  of  : :o:i  :;  A ; rotwc  meetings  In. 
the  Ncith  curing  the  C ivil  Y'a : :c  v;s  a?  nr.  indication  that  opp  >:itio:>  to  I i:i* 
colnb  v.vr  :o!i;y  exl  ud;  Mrs  of  shell  or  pottery  found  in  g:e.ees  : a the 
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routes  of  trade  and  cultural  contact  in  the  prehistoric  woiM;  a peculiar  mili- 
tary cusrom  indicates  the  caste-like  nature  of  army  organization;  modes  of 
speech  may  indicate  complex  mental  patterns  or  cultural  emphases. 

Three  situations  can  be  pointed  out  in  which  one  pays  particular  attention 
to  qualitative  indicators.  They  are  distinguished  in  ter  .1$  of  the  kind  of  ob* 
stacle  which  prevents  direct  observation  and  measurement  ot  the  underlying 
variable: 

(1)  Situations  in  which  qualitative  evidence  substitutes  for  otherwise  simple 
statistical  information  relating  to  past  ages  or  inaccessible  countries. 

(2)  Situations  in  whhh  qualitative  evidence  is  used  to  get  at  psychological 

which  are  repressed  or  not  easily  articulated  - attitudes,  motives,  assume 
frames  oi  reference,  etc.  . , 

(3)  Situations  in  which  simple  qualitative  observations  are  used  as  indicators  of  sue 
functioning  of  complex  social  structures  and  organizations,  which  are  dsffcult 
to  subject  to  direct  observation. 

The  underlying  assumption  in  all  these  cases  is  that  a phenomenon  which 
cannot  be  directly  observed  will  nevertheless  leave  traces  which,  properly  inter- 
preted, permit  the  phenomenon  to  be  identified  and  studied.  A great  historical 
movement,  a basic  personality  characteristic,  an  essential  characteristic  of  or- 
ganizational structure,  should  all  leave  their  imprint  on  almost  any  docu- 
mentary material,  accounts  by  observers,  or  even  physical  refuse,  which  they 
leave  behind. 

Examples  of  the  first  class  (qualitative  substitutes  for  unavailable  statistical 
or  descriptive  material)  would  include  the  use  of  newspaper  stones  or  oilcr 
contemporary  records  of  public  demonstrations  as  indications  of  public  opinion 
in  past  times  of  crisis1;  1 razor’s  use  of  advertisements  for  slaves  in  ante-bellum 
Southern  newspapers  to  find  out  about  the  stiucture  of  the  slave  family2;  the 
use  of  reports  of  refugees,  Soviet  press  materials,  and  the  contents  of  Soviet 
literature  to  provide  information  about  life  in  the  Soviet  Union1;  me  use  Oi 
archeological  remains  to  indicate  culture  contacts  oi  religious  beliefs  in  pre- 
historic times. 

Examples  of  the  second  class  (qualitative  indicators  of  psychological  vari- 
able;) include  formal  projective  testing,  the  psychological  analysis  o!  personal 
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documents  or  artistic  works,  the  an.ih  sis  of  item?  of  literature  or  entertainment 
as  presumed  project  ions  of  the  traits  of  their  audiences  and  of  course  the 
analysis  of  qualitative  interview  s or  records  of  participant  observation.  A good 
example  of  the  use  of  indicators  for  a psychological  concept  is  found  in  the 
study  of  ar.ti-prejv.dice  cartoons  mentioned  earlier: 

“In  tracing  the  process  through  whim  these  6S  respondents  arrived  at  their 
misunderstanding,  we  find  our  starting  point  in  the  fact  that  nos:  of  them  Mend- 
fiat  v/i tli  Mr.  Bigott.  (Footnote:)  By  ‘identifi  v.tion*  we  mean  the  mental  process 
through  wlvrh  a subject  assumes  the  role  of  another  person  to  such  an  extent  that 
actions,  cither  verbal  or  behavioral,  directed  toward  the  object  of  identification 
are  experienced  as  directed  toward  the  identifying  perron.  1 vidence  of  identifica- 
tion with  Mr.  Bigott  was  manifested  by  the  subject’s  acting  in  one  or  more  of  the 
following  ways:  (a)  explicitly  affirming  identification,  saying,  fer  example,  ‘I  guess 
I’m  a Mr.  Bigott’;  (b)  consistently  and  openly  sympathizing  with  Mr.  Bigott,  ex- 
pressing  sorrow,  for  example,  thu  Mr.  Bigott  looked  so  weak  and  sick  in  trie 
‘transfusion’  cartoon;  (c)  interpreting  a threat  to  or  criticism  of  Mr.  Bigott  as 
referring  to  himself,  as,  for  example,  becoming  emotionally  upset  by  the  cobweb 
on  Mr.  Bigott1  s head.”* 

A single  co.  creation  reported  by  tlve  authors  of  Deep  South  bears  witness 
to  the  depdi  of  feeling  involved  in  white  attitudes  toward  Negroes  in  this 
deeply  prejudiced  area:  A social  worker  described  a poor-white  family  in  which 
two  girl 

“'are  having  babies  and  are  not  married  . . . That  isn't  the  cr.d  by  any  means  . . . 
Somebody  told  me  that  this  older  girl  was  sleeping  with  the  father . . * After  die 
recitation  of  the  case,  when  the*  serial  worker  was  out  of  the  room,  a v/or.un 
whispered  to  the  interviewer:  ‘Mrs.  Wilson  says  those  ghls  h.r.e  Negro  vj.v;  :co, 
but  Mi  >s  Trent  (the  supervisor)  won’t  let  me  say  anything  about  ina: . . , Is*:*:  roar 
a + Jult*  It  is  significant  to  note  from  this  interview  that  the  incest  situation  vu; 
viewed  with  les  horror  than  the  infraction  of  the  caua  s:x  taboos.” 3 


0:i  the  bads  of  Ids  long  participant  observation,  William  Whyte  was  able 
to  repent  the  following  striking  indicator  of  the  complete  acceptance  of  gam- 
bling in  Corner v! 


“When  a r«;o:h«v  sead>  her  small  child  do  .vi,  to  fiu-  corner  for  a bottle  of  r./fik. 
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“My  Future  Occupation*’.  The  pervasiveness  of  the  insecurity  of  the  children 
of  the  unemployed,  k?  c orsvO-  e c:;ect  on  pi  Arm  j:i^  for  the  future,  was  indie  uM 
among  other  things  hy  doe  very  language  Used.  Children  of  employed  workers 
would  write,  ‘ I will  he  . . d*  or  “I  warn  to  become  . . Children  of  the  un- 
employed tended  to  use  phrase*  like  *T  might  become**  or  “I  would  like  to 
be  , , d*  In  the  same  study  a small  boy  remarked  to  one  of  the  investigators 
that  he  would  like  to  be  an  Indian  chief,  "But  I am  afraid  it  will  be  hard  to 
get  the  Job."1 

The  third  situation  - the  use  of  simple  qualitative  indicators  to  show  the 
attributes  of  complex  social  structures  - is  very  clearly  exemplified  in  Blumcn- 
thal’s  study  of  a small  mining  community.  The  speed  and  inclusiveness  of 
interpersonal  conumiuicadcm  in  the  community  was  indicated 

"by  t!ie  fact  that  should  a death  occur  at  nine  o’clock  in  the  morning  and  the 
Information  no:  read:  a resident  until  late  in  the  auerroon,  I k usual  expression  is, 
‘I  can’t  understand  why  F didn't  hear  tlr.it  sooner’,  and  others  say  to  him,  * v> here 
have  you  been?  Everybody  knew  that  by  noon."3 

At  another  point  Blumenthal  note*  the  existence  of  conflicting  qiulkuivc 
indicators  of  the  social  contact  between  Mineville  and  its  nearest  neighbor,  and 
concludes  that  one  has  the  greater  weight: 

"During  the  heyday  of  Crystal  her  people  and  tlnx'c  of  Miatville  v.vr?  not  so 
well  acquainted  i*  might  be  supposed  . . . Hotly  contested  baseball  games  and  the 
communities  having  celebrated  together  on  the  rnahi  day  of  festivities  for  c vji  - 
Miner’s  Union  I\r.  - were  rot  indications  of  far- rev  king  personal  relation?.  This 
is  shown  by  the  measure  of  s?.:!al  distance  evidenced  by  th:  fact  that  a young  man 


wnosc  reputation  was  si: 
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n that  k > ‘respect.* b 

! y girls 
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can  army  can  be  inferred  from  the  following  qualitative  indication?  drawn 
from  an  interview: 


“V’e  bunked  teg. slept  t:rc;!  ?r,  fvv3ht  together,  told  each  ether  vhme  our 
rneney  vvs  planed  ir.  our  shuts  ...  If  o r an  get*:  !c:rer  from  home  th:  v.  hole 

coup; \r.y  reads  it/* 1 

i he  nui  :oc?  of  7 f.ic  A^icnc.:rj  van  ting  au  i ndicator  for  the  complex 
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n of  me  ‘ h\r r.\y  caste  $y>:v»n’\  puiiitel  m an  i:^;uuoo!iaI'zc*J  synno.ic 


Tr'istcil  me  i s.heteJ  for  off*.:?  ev ndidatc  s* r.c  ! were  hm  dTchar*;*!  frori  the 


A r me  at 


,»  *i  . 


]\;.i  rw*.d:.i!i!.\i  in  their  ;u*v  ant1  very  different  0 ms.* 


Just  as  it  is  ir  >;  '»>sib!e  m move  from  One  caste  to  rmcth  . in  an  ethnic  cvc 
smtuticr,  <o  an  eidbted  man  a^c*a:  to  become  an  officer  ma  t leave  the  Army 
system  before  reentering  in  hi;  r.cw  statu*.  The  continuation  ol  this  niAom  b 
a ccitain  indication  of  th?  conti?u-it;or»  of  the  attitudes  c*I  a “caste  system*' 
in  the  Army1, 

In  i? housing  the  family  structure  found  in  Middletown's  various  classes,  the 
l.yr.Ji  suggest  a y assude  indicator  of  the  position  of  the  hud* and  i:i  the  fatv.ilty: 

"It  may  rot  l e vhol-y  lantauic  t > suruin  dm:  there  re.y  he  some  sign!  trance 
f.u  the  under:  irking  of  loc  .1  r.ari?  1 '.(icn  n the  h erarcl  / of  terms  by 
vhhh  local  vomit;  speak  of  ibdr  hushv.nJ*.  There  is  a definite  .v;cr.t  of  n.  .*  in  h * 
conjugal  relations  as  one  g^cs  vp  in  d.e  social  ;e  !e,  ficsii  hay  o!J  man*  tf.rocgit 
'the  intu'.  'he*  (rnou  frcqi  rnt),  The  r.drw\  fJo!v  h *rjy  hirh.v-T,  tJ  *Ur.  Jorrd. 
The  lhst  four  ere  the  comm:  a t.rm<  ateer.g  th-e  vc;hir.g  <1a<<  fe«ni1U*,  and  ihe  last 
t v,o  *niC«  g bj*ni»s  class  fani*U»d’s 

The  indicators  which  hnc  been  referred  ta  r.ie  of  many  ilhTcrr.t  futms. 
Swi  are  linguistic,  some  arc  •*. n bolie  sc**r  *on  e a. re  th  e imcntary,  some  arc 
p!i>  sice!  oh;e;m  As  nibs:antne  Luv  !;''jc  cf  Ir.^vJstic?,  $ c’  .1  c;g  n»it;  ulor, 
end  technology  arc  replied  to  the  f ;oh!;m,  o :e  n.  ty  cxpcc;  tver  n >**o  sm-IiUo 
ar  J r«  1:  Me  ir,;c r :c*. uro>$of  sum  qualitative  Indicators.  To  whu  c/t.n  t inter* 
prciaTw  of  in dvato-s  a ill  have  to  rciV'in  r.n  m,  a.:i.l  to  vhat  extent  it  can 
be  trade  a s;!:r,  v,  b one  of  the  imp  an  mo  ptoohms  of  qralitulvc*  res-eauh 
tvhiih  are  cannot  rsc.Yipt  to  dreus*  h etc. 
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And  in  sonic  c.i.'os  tlic  losM.iUr  ir.Uvt  civ  »t<?  his  cl.-.*>ilicAt!oi:  system  for 
the  material  under  study.  It  it  this  laat.i  case  which  will  be  pAfiieulA.lv  con* 
sidercJ  here. 

hi  terms  of  their  forrul  smutinv,  the  descriptive  tViUm*  ctcued  bv  in- 
vestigators c?;\  range  from  civile  lisa*  of  *'typcd\  each  defined  individually 
without  cle.tr  logic/.!  relationship  to  the  others,  to  fully  systematic  typologies  in 
which  each,  type  h a logical  compound  of  a sr.nl I number  of  basic  attributes. 
Hevv/ecn  these  end  points  ; re  alt  intermediate  degrees  of  "partial"  syo.em 
Alization,  including  some  Sets  of  types  which  include  in  rheir  definition  virtu- 
ally nil  the  logical  clcmeuu  nccc^ary  to  set  up  a imtIti-dkint.Mtvi.il  '‘auiibme 
spare",  but  in  which  the  logical  .mwdysi*  has  r bc*.n  explicitly  nude.  Ins- 
criptive $>  stunt  may  also  vary  in  terns  of  their  degree  o?  ci>ncrc:cnc»>  or 
generality.  A fully  systematic  typology  r wy  be  based  on  dinuns;onsof  ;■  lughly 
limited,  concrete  nvivie,  while  n pfvlinibt.wy  Classification  can  be  brord  and 
penrrah 

Urdhtii'i m) 

A ebssifivation  vhV*  fall,  rr^aid  the  vusystonaetbeJ  er.J  of  the  continuum 
can  be  celled  a pre!in;irc.ry  one,  sli  ce  it  rrpre«e;.ts  .1:1  csscritb!  hrst  step  toward 
the  ideal  of  a fi  lly  systematic  one.  *lh:  importance  of  \'v,  On  step  from 
completely  micudcoeJ  data.  ioa  prclir.dr  wy  cl.^srhcetion  rv\?:t  never  be  undrr- 
cstiioAtcJ.  tVit!l  the  data  arc  ordered  in  «e:i»e  vv.y*  the  .V'/ym  of  relationships 
cannot  leginfmore  refilled  categories  normally  develop  c ‘ of  the  AtU.vp:  to 
ard)?c  re! uivvid-dps  bet*. /ecu  j rJimb  ary  citego'h. there  i*  a:i  inlet acting 
pioccrs  b.iv  cm  le'Wmcnt  cf  c!a<>if.e -lio.t  ar.u  the  analysis  of  relationships. 

A gwd  pfelimiowy  cb:ss:fnat;o..  i r<k  provide  a vorkaVc  summary  c?  the 
ve.dtli  of  cbmcms  i i the  Ongbrl  d %t.\  a?  d include  - even  i?‘  in  univsttM  .tic 
form  -•  the  bade  ele  ve  ns  rcee-Mty  for  undt\?pw*  g the  situation.  A bad 
preliminary  cbsdfcuifui  is  ere  •vj.ie-i  is  { e>;  in  ebi  .exits  and  sugge^tivene*?. 
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is  Louis  Virthh  suggestive  on  "some  Je vidi  Type*  of  IV is onalityV  \MrtV* 
defines  his  purpose  it  lining  this  u-dtirquc  as  fallows: 

*’*ll \c  sexologist,  ir:  :: Ansfnr.iiag  die  unky/;  or  individual  c vgerlciirc  into  a 
represent  itivc  cf  typ.Vi?  o;.*,  arrive  At  the  sc.vl  type,  vhuK  tonsils  of  a set  of 
attitudes  on  tie  p trt  of  the  p.rson  toward  himself  and  the  group  end  3 correspond- 
ing jet  of  altitude*  c/  the  group  toward  him  . . . T!.o  range  c»f  pemntbty  types in  a 
given  social  group  is  indicative  of  the  culture  of  the:  group.”1 

Wii tlds  gallery  of  “characteristic  and  pictures^  v personalities  thu  arc  met 
with  in  the  average  community”  includes: 

the  ,Vc«sc/.%  a person  of  superior  ceonomi?  status  vho  In  “Achieved  lo  s\kcc<' 
v ithovt  sactif.eing  his  identity  As 

the  **//'jg\,r/cv.  who  *‘in  his  opp  ;;  i.nisrr.,  hi:  thrown  overboard  n «c-st  of  the 
cultural  haggle  of  his  gr« 

the  5r*7eo j:’  h > ho  belies  the  stereotype  cf  the  Jeer  as  “d  e pc*v>;>ff  i u!cn  of  the 
comnicrcii!  spiii:"  by  King  “^cke  shiftle  s And  Kelpie:;,  «et!h;g  mheraU)  ii 
everything  he  undertakes”; 

the  /.* /hnrMrh,  v l,o  moves  easily  from  one  dr‘jeec‘-f‘jJ  project  t>  another,  cvJ 
*ho$?  “only  apparent  titc*ns  of  sulrdster.ee  is  the  air  he  breather*; 
the  )V;o<?\i  />:'  if,  literally  the  nlwuJictl  student  the  > o jr  g rv.an  v>!  '•s:  Lathing 
fives  pu'itipe  irrespective  of  wealth  or  origin; 

thf  th  1 “pious.  pMiiirel.t!  ; ermn  . . . vho;?  e\C.*r,p!tfy  con  Joe*  is  pdnteJ 

to  As  An  C\3  *ip!c  j ^r>A  so  On*. 

‘I  he  puipA«e«'f  presenting  these  type?,  drawn  h.gcly  from  ;Vo  fedkbro  and 
literature  of  the  subject  group,  lie?  L.  the  fact  that 

* th :)  arc  asc  > vp!<*e  as:  in  dec  as  any  a*,  jrc.crt  cvtwnMc  cf  she  cohere  trait.* .. . 
and  the  cult : re  pattern  of  t;  a trevp . . , Together  they  constitute  the  p-fsanal 
r,  eUt  around  t U,*h  the  fabric  of  ih*  cubvr?  cl  tha  pro  :p  is  vovo.  \ detailed 
analysis  of  the  ciacitl  p*r«*.:v:?ity  ty;  es  5e#  any  oJvt *»  arct  orcvhi.ra)  g.ojp  shows 
tl  at  ih:y  d.p-rnd  up  >n  A ‘it  of  KiVi?*  wl  a? t ::  .d.?  in  the  g:c.p  for  thXr  existence 
and  arc  ti  e direct  expressions  cf  {'  : \ the;  cf  th:  group.”1 

St  tiling  front  the??  type?,  therefore*  on  c-n  derive  a clarification  of  the 
viluer,  Labi;*  and  vhkh  Arc  ir.*pou*r.i  to  the  ev-  Lmotl  'n  cf  the 

behavior  cf  the  group. 

In  nr:dl  tit?  same  ve».y,  f;om  folklore  and  Overture  as  > \!1  as  p;r?aae!  oh* 
Serve t:or,  C,  \ right  Mills  diaves  a c dlerv  of  -rv.  I*?i e ecTv;  typ:?d’  ’1  here  are 
types  oc 
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the  * oU  vet  tram*'  jjst  the  t yp  vha  sab  security  in  closely  fdiov  ing 

explklt  instnj.ti-w,  and  strive  for  J.frrtucc  from  those  lvlo.v; 
the  '‘I’ve  :h.:  pour  *:r  nv;  on  hi*  viy  up; 

the  “ticv  entrepreneur”,  vl  o pro:»ptri  as  a fixer  and  go  ba  veen  in  a world  of  huge 
and  comp! k ao.  J organi/rtkr.s  mass  n. impulsion,  and  gircrd  in<eanit>h 

There  Are  types  uf  irucfkcu  -als?,  of  Academic  men\  and  - all  the  vu y down 
at  the  bottom  of  the  white-collar  pyramid  • of  salesgirls:  "the  wolf”,  "the  dar* 
mcr’\  "the  ingenue*’,  “the  social  pretender”  and  so  cuh 

*1  here  is  x seriojs  purpose  in  pin-pointing  these  pktnrcsqve  types: 

"By  iNAtrinirsg  vhite-ccllar  life,  it  h possible  to  learn  sornrihmg  rbnt  wh.n  is 
bcCoinip.*  inare  typically  Wneiuv.n*  t ban  the  frontier  du.act cr  probably  ever 
var.  Vli:i  mua  l:  grasped  is  the  picture  of  Society  as  j great  saltMoom,  in 
mormon  file,  e.n  In;,  fj  crated  bran,  a raw  universe  cf  management  and  mamp.ih* 
tioa.  By  vndefstandirg  these  diverse  vl, I:  collar  v.*orIJ>,  one  can  a!s  > vndeowhj 
batcr  the  shape  and  meaning  cf  modern  society  as  a vho’e,  as  v, ell  as  the  simple 
h>pe;  and  complex  anxkthj  tlrt  grip  all  the  j :op!e  who  are  sweating  it  out  in 
the  ndd lie  of  the  thirtieth  century/** 


The  pcner.il  run  of  prdimirr  y categoric?  will  not  be  as  colorful  and  ridi  in 
suggestions  as  these  just  qvoud,  but  they  v.  ill  be  of  the  seine  formal  nature:  a 
simple  list  of  dis:<e:c  "types”  ‘i  Inis  v o v,  ill  have  li  of  "typ  $”  of  comic  ! oik 
tenders',  types  of  drent-profcrilcrol  relations,  types  of  appeals  in  a ccrtaki 
propaganda  broadcast',  typsa  of  comr.u-wiies,  etc.,  repressing  a yrdiminary 
ordering  of  mat;,: A i mo  a simple  li-t  of  heading*.  As  the  An  dysl?  prc^re.oo, 
eitl  cr  within  thecri^.ml  study  or  in  the  work  of  rcpiic  toon  or  secondary  ana* 
I)  sis,  thc>e  ample  i •*:«  may  be  de<  doped  i;*;o  more  system  rdc  ar.d  more  peaerd 
descrpdvc  systcr  i>. 

Somcv.iut  further  along  ti  e road  to  giv.srcmty  a.tiJ  $)*tc;nii:«tion  are  the 
hint  s of  "types”  found  in  the  great  deal  of  the  speculative  and  theoretic;! 
liter  a tr-c.  *lyp;cil  c.:anple5  hac  arc  Springer’s  six  ’value  types”  - the 
iucorcttc'o,  economic,  aesthetic,  loci  a.  S,  politic;!,  end  tcligioud  * or  von  Mcidj 
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All  l f the  above  mentioned  $-:t>  < f categories  arc  o:  far  greater  generality 
than  those  vhidi  a r $<e  in  the  analysis  of  a single  Cirphiea!  study  of  li anted 
stop?.  They  .-re  the  result  of  attempt  it  genera!  analysis  of  a vide  range  of 
situations.  On  the  other  hind,  in  their  formal  a spec t,  they  arc  similar  m the 
ether  forms  of  preliminary  categories  di«cus*eJ  earlier.  S^mool  than  .veqriic 
un:\  sten.nizcd;  others  include  ir*  their  uclxi’ion  r:  o>t  of  the  t\:v»0iKs  reepaircl 
to  set  up  a Io^caI  structure  of  h.nie  .vuilctcs  from  vhieh  they  could  he 
it  rived,  In. t this  has  not  been  explicitly  dore. 

A special  ki net  of  descriptive  system  tvled;  rr/gh:  Ve  mentioned  u^dcr  this 
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Sy* ! or.cic  7 > 

The  mo>i  lrg!.!y  chedo; :c  form  of  Jeiciiptive  system  ertn  r*.ri>:  in  a 

qualitative  analysis  i>  on:  in  vh’ca  c.-cli  type  i<  explicit!)'  shrived  f*o:n  the 
logical  con.Binexom  of  Basic  attribute?  or  diir.ens'ons  A simple  example  i>  t!ie 
logic;1!  s Kerne  set  up  by  Ricsmn  in  his  study  of  political  pAuklpAiiotiK  By 
examining  a se:  i f concrete  '‘type  cases"  Rir>:n  in  v/as  leJ  to  break  the  concept 
of  participation  into  nvo  Vase  chine  m;  emotional  invcUxnunt  A*ul  comp:* 
tcntCjOr  r.ore  simp1;,*,  "eating1*  anJ  nng‘\  Taking  each  of  these  dements 
as  a simple  dichotomy,  Riesman  obuined  fou*  types  of  relations  to  politics: 
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C l one  h.'.ndVy  v.  1 ;;hcr  the)  person *!ly  tdlcvc  in  ti  e c*ccd  v?  r,.!t  and,  on  the 
other  I*.v :d,  Vy  v nether  they  piAvtiec  d.'scriminvdon  or  not. 


n 

. V.h 

is  t‘.: 

’ sOicnt  com 

deration: 

iC*:;w-vS  ' 

or 

rv> 

r»<‘ 

: conform  v;th 

tr  JivUi  i!i* 

Cv 

i K!i:«\  c:-i 

:crnin$ 

:V:  nerd 

ch!?  *S 

Cl  i 

;!! 

n;.n  to 

CCU.ll 

opiv* 

S?u 

ec!  in  fcf.r! 

S>-V'«V 

c»?  ten.*;. 

this  AV 

.u . 

t ittitu.* 

vi  ar  J 

OVift 

lean! 

iof  v 

*r«  injjpcnc1. 

Mtly,  Pr; 

: , :dic'  1 1 e : 

:dt*As 

r :d 

t <e>.cU!c  vuS 

d'-cri 

* * o 

,v  U 

Knioa.  TV*  i 

rp!i  Af!: 

s cf  6 is 

Sft^rrc 

il»  iV 

n 1 

out  in 

t , rrr  > 

of  a k 

.«>  at *\  v.  here 

Sv  iV:v 

rid  K>  a*. 

* C IV  i t * 

«■ 

..  i 

.>i:  eJ,  ; « 

cm  ho 

le*  n in  t *.  f i 

:u 

K 1 » \* 

* y.  t ; jv.cgy. 

i ■ » 

: : X \ C,  . ;*  >..t 


/ 6 


Allen  H 


Carton  and  Paul  F.  Lazarsfeld 


A T)  P 'a'^v  of  K;!n:!v  Prejudice  and  Dircrir»ina;!on 
ATI  ITL'DF.  DIMi'.NSION 
Non-pfciv.’ucJ  IV*  jui’iifd 


Non  elite  r'.ir.r: ‘.non 
l-l  HWfOit  DIMFNSION 
DVrlrninWo*! 
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A mere  list  of  the  "lollt-labeld*  of  each  type  would  appear  svp;if*c*3l!y  1 : kc 
one  of  t he  preliminary  lists  of  categories;  they  are  f '.in jAir.en («;!!>*  ihftore.it, 
hov ever,  since  tlicy  aie  systematically  derived  from  the  cross-trbuhuion  of 
two  bade  dimension*. 

The  most  elaborate  r>e  of  systematic  typnb-gics  i;  found  in  Tr.kou  Rusond 
recent  vv»rk»h  Parsons  s.ii  forward  fi\c  divhctorno’is  aitiihuie?: 

I*  AfTcvtivIty  - Affective  ruu  ity; 

2 ScIf*orien:.it;on  - (Vilcctivity  orletit.nioa; 

3.  ViuvcrnlUm  - lVf.Icu!arb;n; 

A.  Ascription  - Adnevemcnt; 

5.  Spevihcity  - iTau  wr.es*. 


liy  combi*’ mg  these  in e^p^tern  v.ui.V:  !^/T\.ti0.islvs  been  able  to  cond  uct 
general  categories  for  de-cribinj  social  rtln:ar<,  CuIumI  system?,  arJ  f er* 
Sana  I it)  systems 

The  piWeu  of  co*o  true  ting  systematic  typologies  need  o ily  1 e briefly 
summarize'!  here.  The  sorting  paint  U til:!1  goo  t prtlimiruiy  f*t  of  rr;* 
goriet.  By  examining  them  one  derives  A small  numb;:  of  r.rrnvt.s  *vhhh 
see jti  to  pie*  Me  tne  bads  for  the  distinction  made,  and  $co>  tl .?>;  a.tirlb  i:o> 
up  as  a mv!:\h‘minc:.M:.I  system  (an  Attribute- spa ee").  Thi*  op*.ut;C:i  Ins 
been  teir  :d  tJ  ■:  NubitrUinto.V*  of  an  Attribute  space  la  a v: po'o  y.  On  can 
than  cwniir  of  t^-:  1<  ghwlly  j v ‘idle  ment;i-»ro  of  tkehv*  enlSeut. 
7 !in  sci  * Cj  to  locate  the  nigh*?.  set  of  categories  % :tMf.  the  r»  0 t it  o': an 
shoe  > tl  it  ? r.c  cov.hi : *;ie-o  have  l ;er.  ig  ared  (;pp.:r  an  blank  c-:!T - % vi  se 
in  otKr  iwe?  /h<;:  a:,o*v  haw  Vem  tinned  (V  e c.;!g;  -d  c*:-..;*vy  \ hi  «.  ve:!ip 
several  cells;.  Of  enr?  rot  *!r  -ftbe  hah  hy  ivhXu-  ray  I ' 

. • • m •*  i - ■ r " i . 
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f»rir. lysis.  Sub  a recombi  nation  1 ..is  been  turned  a "red'icdon**,  And  is  closely 
relate J to  tile  operation  of  inJo;  formation*.  * 

ti  i:  $rlt$tr*tClioti± 

'i  here  remains  to  be  mentioned  a type  of  operation  which  is  vciy  freqnciH 
in  qualitative  analyses:  tbe  paidai  syster.v.ti/.ation  of  a concept  or  a set  of 
categories.  A p.ood  introduction  t-»  tills  operation  is  t!ie  well-known  ds'aivmn 
by  Simnicl  of  envy  ar.d  jealousy**  *ll.e  situation*  in  v*V:di  these  feelings  «ui<c 
art  qvitc  complex,  and  Simmel  doe?  not  give  nn  exhaustive  account  of  them. 
\vIku  be  does,  however,  is  to  indicate  one  important  A<pi;t  it:  vbicu  the  two 
attitudes  differ:  ir.  she  case  cf  jealousy  the  p.t;un  fee’*  that  be  hat  a cl.vni  on 
llic  object  of  Ids  d.slre,  v Idle  in  toe  cw  of  envy  he  b.'S  no  chut  i,  only  dedro 
fo;  the  object,  Sircnul  has  dm*  partially  sib:ti  c.ted  the  attribute-space  by 
wbiJi  envy  v«d  jeeloiuy  could  be  systematically  ikfncd;  be  l.a;  not  dorse  so 
cornpb:e!y,  but  ruber  only  incurs  tonnhe  one  major  distinction. 

A more  elaborate  but  s:i?!  partied  substruction  i>  puserted  in  Werner  1 w.b 
c Act's  discussion  of  **Yy p.>  vf  Integration  an  IThrr  Mcvurimem'b  l.  in  deck  a 
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and  shill*  in  i*o  pre^veloe  [b]  'I  here  is  no  of  vjA  ar.d  play,  or  work  And 
culture.  [6]  Tee  cs'i.'.  rw.h  way  or  !ivebhooJ  deicrioine;  .viJ  i ‘i  >.  i his  entire 
mode  of  l;vir;.Ml 

In  effect,  Mills  proposes  six  attributes  by  which  a job  situation  can  be 
describe:!*  If  .ill  six  of  these  .ui:ihiKC«  have  the  valves  indicated  above,  ve 
have  the  ided-typo  situation  of  "craftm.unship",  The  mi  nation  of  the  modern 
industrial  or  cfd.ce  worker,  Mills  implies,  is  the  opposite  of  the  idealized  craiU- 
* man  in  all  the*?  respects*  Actually  the  six  AU;»butt<  gUv  6*!  logically  passible 
combinations  of  value;:  the  intermediate,  mixed  co;nl  itntiom  hovewr  da  not 
enter  into  Mills'  present  discussion,  which  deals  only  with  the  iJealtyp?  eases 
and  not  with  the  whcle  .^tribute-space. 

In  this  Ci?e  cad;  of  the  six  attributes  actually  refers  to  .1  relationship  ~ be* 
tvecvii  a worker's  capacities  end  his  work,  between  work  aiu!  leisure,  etc.  Viuy 
can  be  derived  front  a rJatio.ul  matrix  consisting  of  four  e!cr»c. .ts:  the  worker 
(his  capacities,  his  character/;  the  v. ode  activity;  the  iinal  product;  and  the 
worker's  leisure  0 ■; 1 1 \ ilics  (his  "play",  "culture”,  "general  mode  o\  living”)* 
l adi  of  thc;e  can  act  on  ary  of  the  ethers,  as  srmiirfoed  in  the  relational 
scheme  bdov: 
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puked  out  a. t \ .1  put  ir.t  a .1  utm  ■!  i heme;  ;ki*  >:r.u.c.;ie 
svhldi  are  irnpOiUnt,  which  v:i:!  uhlm::  eh  help  ;<•  sc*! 
wc  Arc  interested.  !:or*na!  an.tiyds  c • n then  be  u«i\! 
conirnunic.vu-  the  result;  of  qualitative  itiskthts. 


act  if  to  "fee i ou;*’  those 
\c  the  problem*  in  vhidi 

tv  clarity*  oc v c!o;\  and 


ill  QL.MITATIV1  h AT  A SUOGlsVINO  kl‘.  1 ATI  ONS  JI I PS 


The  only  fully  Adequm ; vay  m test  the  c.%istcr:cv  of  a relationship  between 
two  vainblcs  is  through  statistical  analysis;  to  test  eause-Ard-eiTcct  relations 
icqut.es  either  a controlled  experhuent,  or  a rather  large  rami*  t of  croei  of 
"natural  vhanpe"  observed  over  time*  But  reseeuv,  which  h'.?  neither  statistical 
weight  nor  c\}  e hnenm!  design,  rcsearc h bwed  only  on  qualitative  descrif : ior*S 
of  a small  number  of  cases,  cm  tu'.:thc!e><  play  the  import mt  role  of  suggest- 
ing possible  relationship,  causes,  effects,  and  even  dyu.'nic  p;occ?*es.  Imbed, 
it  can  he  a.rp  -d  that  only  research  which  provide*  A we;d;h  of  mbcclhnoov.s, 
unplanned  impressions  and  observations  c*n  play  this  role,  Those  v. ho  tr>  to 
pet  suggestion*  for  possible  ex- la*  atory  factors  for  statistical  results  s;!c!y 
f om  looking  At  tabul atians  of  the  te  a variables  vlilc-h  vcredelihcratvly  huh. dud 
iri  the  study  in  advance  often  cm  make  no  p:  ogress;  fan, :t:n;c*  ever,  a sirjle 
tcriuer.dn  conn. cm  by  a.  respondent  v ill  pRvid?  A clue  to  addino;*.*!  factor*. 

/ rfi/r.g  "/hu/o »/* 

A J.tiMc  ease  of  the  uu  of  qualitative  obr.tv.uor:  to  disclose  j ouiHc  factors 
intloencirg  behavior  is  the  Vestern  Electric  study*.  VPlen  the  expe* ir/.e.ital 
p t o ii jv  of  workers  main*  \ine.i  their  hie!*  pr:-h:vt;en  even  when  physical  con- 
ditions were  made  worse  c m a n before  the  e prhncr:;  began,  it  w;s  clear  that 

?acil'*n.  Vx  hat  the  real  factots  veers  was 


!om 


was  effecting  their  preV 


first  suggc.ud  by  informal  conversation*  with  and  cb*;rvatio*  s of  the  experi- 
mental guup,  a :;<l  f:o.:*  then  Ca  the  mu  r:«evch  etTert  v.\*?  foeus-ed  on 
qualititivc  i*  ur*k vug  and  cb.-et* a;:e-i  to  diseovtr  uva.I  fa. tors  a .d  j ;> 

CC55:  *. 

A fi  ttut  stifclv,  the  r vn  Jo:*  c ;*  vkh  ' as  to  urea  *r  pcndM*  fw!; 
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"the  Ail-uny  sert.s  if  e.ppc  ti>  cnverced  n<  a drcrr.odc  evint ...  a single  imifvJ 
p.uterrdh  "Ytvere  was  icc:procAi  interplay,  for  the  Audience  was  no;  on!/ 
responding  to  Sc.dth.  V j:  she  v.*n  also  responding  to  her  AuJdn.ce  An  I modify- 
ing her  suVseguan:  c ins:. eats  as  a remYh  "there  vaj  considerable  ip.i.t!i:ui\v 
cs  idence  tint  i eidd  i::  Sirdtods  disinterestedness  An  i Altruism  phued  at  integ' a! 
rolo  irj  the  pieces  of  per>.;A>:on*h  "she  ah. lienee**  images  of  Smith.  the  cl.ns 
structure  of  our  society,  t o ecdtanl  standard*  of  distinct  strata  of  the  po* 
pulAtion*  a::J  socially  induced  expectations  feelings,  tensions  were  all  intri- 
cately i:i\o!vcJ  in  the  patterns  of  response  to  the  bond  drive  h '‘but  the 
cumulation  of  afTec t and  emotion  v.i<  t.ot  the  major  function  of  the  rcar.uaon 
broAvIc.; *:$.  Abo*,  e all.  toe  p:  era :neJ  stress  and  strain  o:  the  cightecn-hoyr  series 
of  IvoaJcAjts  ser^iJ  to  validate  Smith**  dnce.it/  . . . for  on  understc.r.uing  of 
the  proves*  ot  p%rstJAsor.,  the  nio*t  d/o ;t;c  on  feature  cl  th.e  e response  to  the 
marathon  is  the  eiTuti wreis  of  this  p;op.\g  auda  of  the  J w!  among  the  very 


p e op ! e who  w e re  d ; < : r a s t f c 


ssepti.r.i  o!  were  cxrotttioti. 


A study  Applying  the  ?A:ne  tcciirdcpie  ;■>  i i>roA  i ; hdto.icil  situation  w,u 
tint  Oi  Idicuvt-i  Xerr.er  on  the  factor*  in  recent  h;*icry  vddeh  ir.fueuced 
attitudes  toward  Jvves  in  1 r.mcch  By  a very  5*nt!l  rend  er  of  detailed  inter- 
v.cte*  v»i;j  people  v !.e  were  pre;c*:icd  10  be  go  v oorervet*  (aIkh  t nos 
cotuicigo  of  Apartment  bou:w.  the  orh.r  ! df  iiOidiertcaYb  it  vas  sag  gened 


that  these  v ;rc  loir  :>\dn  induecc;:.*:  OtitcJrs 

nt>»\er  or  ar.oth.r;  the  j er-c. atior.s  cat-idc  FrAi.ee.  *h 
aware  of  the  J.w*  a?  a -pecial  .roup;  :h.-  a;  -pea  ranee of 
other  cor*  tr:c:  * ho  were  a c!e>r!y  vtd.hle.ddtTeoint  gee-up 
persecud  n.  cf  d l J;  Jews  Ju-irg  the  i ccup.v*  rn,  ‘ 
S^eliup-  c f guilt  e\.;  a certch*.  teal  voc.ger  ? *r  the  e vho 
te?toret!  w of  Jev.bh  ; v?  rr*.J  property  a Ac-  the  td.be:.. 
Civ  v.i  ! :: : er  1 disturb;.:  cv  tv’  t'-  ne  j. >•.-!.  >>. 

One  5p;c?\l  tch  for  c.*:ovc:h-g  ejd  ti*  r ! -h  ;* 
ts  pe  Os  le!  *v*;f  U d : cx*:.  h .den  of  cecc*  %r;’.  d * i 
tx/A.;:d  h’  cf  V.n y : w;c.:.  Y.ve?  a pv:d* 
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r< lie* -i t i !c ’ * nuM;]!i!p.  To  o.p-n;:!  there  cases,  quantitive  hucivk-vs  were  used.  It 

appeared  tlut  such  incr-cs  were  involved  us  the  decree  of  security  in  one’s 
attitude*,  the  feeliru:  that  ou’s  beliefs  were  socially  earned  and  not  a personal 
responsibility,  the  tact  that  the  subject  totally  disidentiflod  Iiim^If  from  t Jic 
caricatured  figure  of  “Mr,  ]T;ott”  and  therefore  were  under  no  threat1. 


Q.7.?//Mr/:r  Sj<££C>:/o/:s  0/  /Voces* 

The  simplest  form  of  a “process”  analysis  is  that  whhh  looks  for  an  intor- 
vcniin;  variable  which  “explains**  the  correlation  between  two  other  variables. 
In  his  study  of  an  Ka$t-co:sc  slum  peighlwhond,  William  Whyte  atnvui 
(on  a qualitative  bash)  at  the  following  relationdjip:  the  socially  aspiring 
“college  boys'*  clubs  seemed  to  be  more  unstable  and  subject  to  internal  con- 
flict than  those  0!  the  sione-iuulflle  "corner-boys’*.  To  explain  this  relationship 
(which  could  be  considered  quite  "upside-down’1  from  a middle-class  view- 
point) Whyte  introduced  a third  variable,  “minimal  orL'ani/ation’1.  The  cor- 
ner boy  clubs  could  draw  lor  cohesion  on  already  existing  inform  tl  organi- 
zation : 


“The  duly 
me rr.be is  .in  .1 


activirres  »;f  the  -lorner  Iv  ys  dcur.vVcd  the  relative  positions  et 
lloeaua  rciprudl  :!id;j  and  oML;-.:ia:;s  wbhia  : b e ere.p.” 
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“The  pattern  of  soritl  rvhiiny  in  Cornerville  c an  best  be  understood  when  it  is 
contr.^tcd  with  the  patrerr*  of  corner-boy  activity.  One  of  th:  rnoit  import a ;it 
divergences  arises  in  matters  involving  the  expenditure  of  money.  Tlu  college  bo;,  s 
Jit  in  with  an  economy  of  savings  and  investment.  The  corner  boys  fit  in  '.wth  a 
spending  economy.  The  college  bey  mu^t  save  his  money  in  order  to  finance  his 
education  and  laur.di  his  bufinessor  professional  career.  He  therefore  cultivates  the 
middle-class  virtue  of  thrill.  In  order  to  participate  in  group  activities,  the  corner 
boy  must  share  h'A  money  with  others.  If  he  has  money  and  his  friend  does  not,  he 
is  expected  to  do  the  spending  for  both  of  them  . . . Prestige  and  inslucncc  depend 
in  part  upon  free  spending/'1 


This  observation  indicate*  someof  the  factors  in  the  process  of  social  mobility. 
Of  course  behind  each  such  factor  uncovered  arc  other  factors  - the  variables 
which  for  instance  determine  who  s:hk>  with  the  boys  and  spends,  and  who 
breaks  away,  saves  money,  and  rises. 

J he  ui. covering  of  possible  processes  can  go  much  further  than  inserting 
a third  variable  in  a chain.  The  study  of  anti-prejudice  cartoons  mentioned 
previously  suggested  a whole  chain  which  led  up  to  misutulc. standing  among 
the  non-deviant  two  thirds  of  the  prejudiced  people  who  misunderstood: 

1.  Identification  with  Mr.  Bigot t and  momentary  understandings 

2.  Desire  for  escape  from  idciidiiration; 

3.  Difidet.tificattort  mechanism  (caricaturing  Mr.  Ifigotc,  nr.klr.g  him  intellect  j illy 
or  social!)  inferior); 

A.  Derailment  of  updcistiadirgs  nlso.ouon  in  the  derogatory  characteristics  of 
Mr.  Ihgott  to  the  exclusion  cf  understand/ag  the  point  of  the  cartoons5. 


In  the  Wnyte  study  one  finch  the  process  of  the  iTo  of  a local  comet  bov  to 
political  leadership  traced  out  through  a scrie.*  of  steps,  with  inter  acting  forces 
noted’.  To  get  a start,  the  corner  boy  in  list  demonstrate  Ids  loyalty  and  ability 
to  got  results  for  his  Immediate  circle  of  friends.  Yet  “if  he  concent,  a:e>  on 
serving  his  ov. n group,  he  veil!  never  win  widespread  support/1  “In  order  to 
win  support  he  must  deal  with  important  people  who  influence  ocher  group?/’ 
Since  he  has  only  htouo.d  resources  in  terms  cf  energy  arid  access  to  official 
favors,  he  must  c bxr.ty”  his  or.guvd  fi  renews  t>y  u.gi.etu.c  the;;  \ me  rests  and 
using  hi?  resources  to  rulp  outsiders  and  lug-shots.  The  result  is  a wfih  -pread 

erg  tne  ran*,  and  lne,  wmch  iiV.ght  oe  expected 
/j.ytv,  the  politician  is 
fohower s in  ;:.e  dbt.'L  t 


ever,  acco ten: 


to 


cynic. sm toward “pokfwan;  a; 
to  cause  constant  turn  ate;.  II 
norm. illy  able  to  “trx.  Je  do  a n*’  enough  benefits  to  hi 
as  p.  whole  to  prevent  a revolt,  even  though  his  closes: 
the  higher  .vpec:  Jt:c:r»  mat  b;  badly  di filxomned. 


c>:  t\  .g:na:  : . 

j ^ h... 


v.o  ti ' a 

. - i 


*/ 


! 


Allen  H.  Barton  and  Paul  F.  Lazarsfeld 

of  equilibria::!,  presumably  ut  a leva!  determined  by  the  political  abilities, 
initial  “connections”,  and  good  lu;k  of  the  individual  politician. 

In  exploring  for  possible  factors  affecting  some  give::  variable,  or  for  dia?ns 
of  causes  ar.J  effects  constituting  a “process”,  there  appear  to  be  two  basic 
techniques.  The  first  attempt*  to  obtain  objective  infornution  about  the  se- 
quence‘of  events,  particularly  what  events  preceded  the  response  under  in- 
vestigation1. The  typical  questions,  whether  addressee!  to  a subject  or  used  by 
an  observer  to  guide  his  observations,  arc:  “What  happened  before  X?  \vlut 
happened  just  before  the  subject  made  his  decision  to  move,  vote,  buy,  steal, 
etc.?  What  was  the  frame  of  mind?  What  had  been  going  on  in  the  family,  neigh- 
borhood, nation,  world?  Had  he  been  talking  with  anyone,  reading  anything, 
listening  to  anything?”  Some  responses  will  look  like  causal  factors  imme- 
diately, on  the  basis  of  our  pa>i  experience  or  gene  ra!  hypotheses  about  human 
behavior.  Others  v. Ill  only  become  prominent  when  we  notice  an  apparent 
correlation  between  them  and  the  criterion  L-havior  in  several  cases. 

'Hie  second  technique  is  to  ask  people  themselves  to  explain  vrhat  happened 
and  to  give  their  reasons  for  acting  a>  they  did-.  The  basic  question  here  is 
always  “Why?”  This  technique  has  obvious  limitations:  people  arc  often 
unaware  of  the  real  motives,  of  indiiect  influences,  of  the  precise  chain  of  causes 


and  effects,  of  underlying  ncce>>u:y  conditions.  On  th/  other  hand  it  stands  to 
reason  that  th : participant  knot  s a good  dee!  about  his  own  behavior,  partic- 
ularly about  attitude.^,  motives,  influence?,  “trigger  events”,  and  so  on,  end 
often  can  tell  the  outdde  invesdg.uor  a bout  things  which  he  would  never  have 
guested  by  himself.  “Heavens”  may  not  be  the  whole  story,  but  the;,-  arc  an 
impOiiant  sou: cc  of  information  on  possible  factors,  and  in  some  cases  a quite 
indispcnsible  source,  especially  in  the  c.uTy  stages  of  investigation.  P»y  adding 
to  th?  general  “why”  quay  a set  of  more  specific  questions,  focussing  the 
respondent’s  attention  on  each  of  severe l bade  aspects  of  the  sltuuion,  reason 
questions  can  obtain  more  adequate  coverage,  although  still  limited  to  what  the 
responder;;  himmlf  is  in  a position  to  know. 


Both  of  these  tedunques  are  combined  »n  a te 
plorvdou.  of  cans  d re!. .dons  known  as  “di  •cerniri- 
da:r‘b:d  in  Mirra  KomarovjVy's  study  of  the  c-tTe 
family  status  of  the  hubarsd Wid;  c \y  ca-e-s: 
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posable  to  unwreake  a full-?,  cam  s:a:i>tical  analysis  of  the  interrelations  be- 
tVrCva  nl ! the  pocdblc  v r- :! aL>Ic Whamvaj  done  was  totakv  each  case  of  apparent 
change  due  to  the  husbandh  unei  plnymoiU  and  subject  it  to  systematic  check;: 
Ilad  the  Huri-o  already  begun  before  die  u:ve;nploynvcnt?  Did  ocher  factors 
arise  concurrently  with  unemployment  which  might  have  been  the  real  cau>e? 
Arc  tire  participants  able  to  trace  the  step-by-step  development  of  the  change, 
the  detailed  links  between  unemployment  and  the  altered  role  of  the  luobnud? 
If  the  respondents  bclivw  that  unemployment  was  the  icw  on  tor  a certain 
change,  on  what  evidence  do  they  base  their  opinion?  by  these  techniques  ic  was 
possible  to  imbue  with  considerable  promise  of  validity  the  causal  relations 
between  unemployment  and  family  structure.  The  search  for  "possible  facm:V 
and  “possible  consequences"  was  made  systematic;  vdthi:i  the  limitations  of  th' 
data,  real  precautions  were  taken  against  spurious  relations, 

Q:i,::i-St.i:isncs 

Previous  sections  have  dealt  v.dth  operation:-  of  qualitative  analy.v’s  winch  are 
tssenti  \lly  prior  to  quantitative  research;  observations  which  rabe  problems, 
tint  formulation  of  descriptive  c itegrvles  the  uncovering  o:  possible  civsal 
factors  or  chtk  s of  causation  foi  a partiauhir  piece  of  behavior.  These  oper- 
ations uimuhu:  and  focus  t quantitative  research,  and  they  i c up  the 
dim,  unions  and  c ttegoru  • alum;  the  “stub**  o:  the  tabby  bito  v hhb  quan  itativc 
research  may  i.ll  the  actual  frequmtde*  aj  b :aeai-v:\ mem  s. 
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as  a continuing  and  increasingly  retinal  piu-’aic  by  the  whole  community 

of  socLi!  sc  Ionises. 

An  cnamr de  i f lac  d:m.y:>  of  irnpressionUti:  “qmsI-stAUsvcs”  is  given  t*y 
Bernard  Huh.,  i:i  :n  artkL  rm  pm ticipanou  in  voluntary  ac>o*iat tons: 

“American  ob;crver>  theiwulv- > were  ever*. dudmed  By  whit  they  did  not  fuUy 
urukutmid:  inuruue  die  MV-v.ing  from  Charks  and  Mary  Beard’s  The  H:sc  of 
Ancric.it : Civil::  . in*. >;•  ,The  Tendency  of  Arable  a ns  to  unite  with  their  felhws 
for  varied  purposes  . . . now  became  a gem — ru.inui  ...  It  was  a rare  American 
who  was  not  a member  of  four  or  five  socicov;*  . . . Any  c i t i r* en  who  refused  to 
affiliate  with  one  or  mure  asmei,. turns  became  an  object  of  curiosity,  if  rot  sus- 
picion*. Although  in  comp  u.vrvf  perspective  the  United  States  may  well  he  a 
‘nation  of  joiners*,  a scivey  of  the  available  due.  on  the  number  o:  people  with 
memberships  in  voluntary  a>soci:do:is  rweal;  die  little-known  fact  that  many 
have  not  even  a single  such  affiliation.  This  lu.uormity  too  holds  i or  all  typej  o* 
areas  in  1 1 c Lifted  States,  whether  urban,  suburban,  smell  city,  small  town  ol 
rural. M| 

Barber  then  quotes  statistic  showing  that  in  these  various  areas  ami  strata 
of  the  population  from  one  third  to  over  tv’o  thirds  w the  people  do  not 
belong  to  any  voluntary  associations.  As  on:  proceeds  from  simple  frequency 
distribution?  to  conehutons  and  thou  to  systems  of  dcnmnir  relationships 
between  several  variable?,  impretsionistic  4k]uad-s:atis:Iv5M  become  steadily 
less  adequate. 
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cviUu ;■  i within  a civih/.ed  society.  IhesO  iiK'tlioJj  seem  to  iin\c  sue c c c e d m 
pi  canting  a good  first  approximation  .it  lease  in  the  description  oi  the  culture 
and  behavior  of  such  group.;.  When  aiuhropobogkts  now  call  lor  iornu! 
sampling.  diti’ rt'cor^K.j',  and  statistical  analysis  it  is  eithei  to  c-ten  up  fiiKt 
details  --the  small  number  of  deviant  individuals,  for  instance  - or  to  deal  with 
situations  of  culture  group;  with  less  homogeneity  - with  groups  hi  process  of 
acculturation,  breakdown  of  old  norms,  or  tne  development  of  stiong  mteuial 
differentiation. 

In  situations  of  less  homogeneity  and  simplicity,  it  is  doubtful  that  quad- 
statistics  are  anything  like  a full  substitute  for  actual  statistics.  However  they 
can  still  play  an  important  "exploratory”  function.  Statistical  research  is  too 
expensive  and  time-consuming  to  be  applied  on  all  fronts  at  once;  like  the 
20 C -inch  telescope  ic  must  focus  on  a few  areas  of  particular  interest  for  inten- 
sive study.  Quasi-statUt’cal  studies  can  run  ahead  of  the  more  cumbersome 
quantitative  procedures  to  cover  w ide  areas  of  social  phenomena,  and  to  prme 
into  tangled  complexes  of  relationship  in  search  of  possible  “processed*.  I hey 
serve  as  a broad  scanner  and  “finder1*  like  the  videangled  but  les*  po \\ eu id 
Schmidt  telescope  of  Mount  Wilson  ant!  Palomar.  Moreover  the  gathering  and 
analysis  of  Mqu.\$i-st.uistiral  data”  can  probably  be  made  more  systematic  than 
it  has  been  in  the  past,  ir  the  logical  structure  of  quantitative  r e > ? a ren  at  least 
is  kept  !:i  mind  to  give  general  warnings  and  directions  to  the  qualitative  oj- 
server. 

Sys  t an  r * ic  Cav;  p . . ri so  k 

There  is  oi-e  special  form  of  tesearch  into. relationship  which  stands  on  the 
border  between  statistical  and  quasi- statistic;.!  methods.  II  his  involves  the 
systematic  companion  of  a relatively  small  number  of  cases.  It  differs  from 
quasi -statistics  in  that  the  cases  are  proceeds  along  lines  closely  approximating 
those  of  a statistical  survey  or  controlled  ex  pci  intent.  I lowever  it  involves  too 
few  casus  to  actual! v e.pplv  statistic. -.1  tc.'ts,  and  it  involves  natural  s ! tti.il: or. * 
in  which  one  cent  rot  be  cutahi  that  "other  factors  are  equal”  to:  the  various 
case*  bevor.d  d o.;e  factors  specifically  analysed.  It  is  a*  though  one  sc:  up  dir 
tables  for  a statistical  or  expeii mental  research,  but  had  only  one  or  two  cm:s 
to  fill  in.  c.vh  cell,  and  pc- haps  had  :o  leave  some  entirely  empty. 

Thii  form  of  “compv.vnivr;  rceerdi”  is  the  o:  ly  p;>s  ;b!e  when  tnc  “casc> 
lobe  studied  arc  *o:hI  phenomena  oi  a buck  order  oi  cm.-plex tty,  wh  as  vv:-. 
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or  an es ltd  civilisations1.  Weber  te.cvd  the  same  suu.veon  in  dealing  with  the 
roleof  religious  sysicmsin  the  development  of  society2,  Beside*  rhe  total  number 
of  available  cases  being  small,  each  is  a very  large  and  complex  unit  which 
requires  groat  time  and  effort  to  analyze.  Even  where  there  are  a large  number 
of  cases,  this  factor  may  compel  the  researcher  to  restrict  himself  to  the  system- 
atic comparison  of  a few,  This  situation  arises  in  studying  communities  or  large 
institutions,  lb  describe  any  one  community's  social  structure  is  such  a large  job 
that  most  studies  have  been  of  single  cases.  Only  allcr  different  researchers  over 
a generation  have  prod  viced  a dozen  or  so  such  studies  can  a ‘'secondary  an- 
alyst0 undertake  a comparative  analysis.  In  the  long  run,  it  is  to  he  hoped  that 
data-gathering  procedures  on  such  complex  “cases’*  can  be  so  simplified  that 
statistical  studies  will  become  possible.  TTten  the  intend vc  study  ol  one  com- 
munity, factory,  union,  government  agency,  or  voluntary  association  can  give 
way  to  a quantitative  study  of  a sample  of  such  cases,  testing  the  hypotheses 
derived  I rom  single-cav  studies.  01  course,  where  the  cbiiiculty  lies  in  the  l act 
that  there  is  only  a handful  of  case  t all,  the  comparative  method  ist  the  best 
we  can  do. 

An  example  of  systematic  comparison  of  a small  number  of  case?  is  offered 
by  Lipsct*s  study  of  the  Canadian  province  of  Sashat  lx  warn  The  population 
of  the  province,  mainly  when  fawner?,  had  a remarkably  high  level  of  parti- 
cipation in  political  nfftirs  and  in  collection  lor  t h\>  unusual  behavior,  com- 
parisons were  first  made  with  areas  where  participation  was  known  to  be  low, 

The  amount  of  participation  in  public  affairs  in  the  luge  cities  of  Canada 
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tci  i to  be  rr.ooop.dieed  by  upper-class  people  ar.d  professionals;  in  Saskatche* 
v;i[\  school  boards,  telephone  companies,  marketing  agencies,  etc.  Ii.ul  to  b: 
stalled  by  ordinary  farmers,  who  thereby  acquired  organizational  and  political 
skills  unknown  to  ti  e average  city  dweller. 

Certain  rural  areas  aUo  are  highly  sn  atiued,  and  in  such  area  , the  rate  of  mass 
participation  is  abo  low: 

"Within  ilu  rural  areas  of  the  Southern  States  or  in  part,  of  California,  where 
significant  social  and  economic  cleavage  exists  within  the  rival  comruvtmt) , the 
wealthier  and  uppcrclas;  farmer,  are  the  formal  cmnnumity  lenlers,  and  the  bu:k 
of  the  poorer  farmer,  are  politically  apathetic/1  (p.  2w-2} 

A third  structural  factor  distinguishing  Saskatchewan  is  its  expomtv  to 
extreme  economic  fluctuations,  due  to  the  unstable  price  of  its  one  main  crop 
ami  to  the  jccutrence  of  drought.  In  t!u>  it  can  be  contrasted  veiui  its  eastern 
neighbor,  Manitoba,  which  lias  more  diversified  crops,  stable  markets,  and  more 
reliable  weather.  And  It  is  notable  that  Manitoba  today  has  niuca  less  com- 
munity and  political  activity.  Low  participation  is  aim  found  hr  tnc  Maritime 
Provinces,  where  the  farmers  generally  hive  a love  standard  uf  living,  but  do 


not 

c::pei  u 

M.r 

e the  cl 

non 

tC  i 

alter 

uatlo 

i . 1. 1 

n between  veea.ti; 

I and 

p 

over  tv  of 

the 

fan 

ncr  s 

of 

S.v 

;kaic!,cv 

can. 

Havi 

* i 

1 >Oi 

a led  ih 

> e 

p-o. 

>;vo 

sources  of  high  pen  tie 

■patio 

■ 

by  comp;. 

ring 

the 

..OCl 

.'.Is; 

rt.v 

rture  of 

S.v1 

. a c< 

|C  »>’ 

an  v; 

irhtbntof  ar^asof 

I.)-..-  j 

■snru  eipativi 

. vc 

can 

no  x 

-■  L 

ok 

for  oth 

' r y 

rea 

S i i 

*ch  k 

:\c  wpially  fave: 

-1>v.  „ 

...  .w  r 

» a. ; 

rcri'S  t'f  k 

. * , i 

diet 

, ac  t . 

.*  r : s r 

Ics, 

, The  nr. 

LMxori 

1 1 V* 

eli  c cal- 

-bell  areas  t:f  Xor: 

h l).i! 

tee 

:.i.  and  Alt’ 

hav 

j vi 

1 1 u . ■ 

dlv 

i):e  c a : 1 

I 

ie  o 

nr, 

1 ctw 

is:ics 

: small  pelitve.l  vn 

i:;-.  !i 

..  i 

e soci  J s:; 

;atl- 

f.C.v 

tion 

, an 

J h 

,igh!y  ui 

jl.* 

eeo: 

;o:n:e 

And  both  these  . 

areas 

fia 

read 

con 

until 

: i i ; y 

P* 

triicipat 

don 

thi 

!i  !oc 

il  uovc m nent  en 

d.  COC' 

ne 

rattves*  e.p.d  a 

re.n 

u*:e 

i>  t 

a d 

new 

pv 

>lit!c 

a!  iiunemJmts  wnen  ce- 

12  4 l c#!i 

1 by  ecen; 

>mtc 

Cl  i ' 

!s/i 

b.e  « 

nan. 

x struct 

uial 

.1, 

if  aetcrist: 

cs  vcec  found  In  >, 

Mnbc 

i , 

» j . 

i in  the  Kv 

iO’s, 

a r » 

ebb 

h rl 

m »■ 

the  ?.;■ 

1 

arlau 

pcditica!  moveav: 

».l  cv« 

. 

,d  widesp; 

read 

pan 

t:cip 

‘ ' 1 1 *, 

when  ] 

k r 

* 1 
. 1 1 0 

i'.C 

: - enene.Cvl 

h the 

cu 

.■vemp,  *e.: 

t cl 

a la 

,r;;o 

urb 

*n 

coiner  n 

• 1 

VPv.l 

an 

^r; 

'ee-cl 

. ,v.*Csi>:ci?Hv*j 

Sefvi 

c-: 

?,  and  due 

. . ,iy 

th. 

1 ► 

Cl  I ' 

j i'  ■'  i i 

1 1 c 

t ton  cl 

‘ • • 

lure 

1 ■ X 

v;i,a. 

, t'v..  cotv.pl 

h 

: y x 

me^uce  or, 

i 

k tae 

Y.  £ : *: 

:at  crop 

V • ■ 

v s.s  v>m 

'tier 

;;  v. 

! x.  i : i i 

in  p-a 

::i\-  fvll  0!f.  Mv 

rtoba 

,s  t-:vv;.' 

natural  A orc-av.:- a;.;. 

l.rvci  i that  ;h ; same  p-tterr.  oi  stracrirJ  mv;  :e:o:ics  v.wcn  c>:;u^ 

ir,  S..'L  i\’.  y e c a!*o  be  fmave  :n  co*:  r.iriutiwi  tar  removed  i * - ■ t a too  spe- 
■ .*  . ,r  . « , , . i , . . 1 , , ' . . . , . v , j . . \ ...  » . : ■ . . : i . y ’ 


c.'.A 


y iww  :*.:a  . y \ or..  t>.  ana  :.v-.v. 
t.\  ■;  omh:  : ■ . h cyv:  w.v 


\: 

pcmnc 


o 

ERIC 


Allen  H.  Barton  and  Paul  F.  Lazarsfeld 


relive  than  r.cbrhboi  mg  urban  areas;  its  widespread  political  par ticip.uioti 
resembled  that  of  die  Sadiatche v.v.n  farmers  (pp.  3C3-3C.4).  The  generalization 
of  explanatory  factors  from  “the  whe.u  economy”  to  attribute*  applicable  to 
any  community  obviously  opens  up  a much  wider  range  of  cases  ior  use 
in  comparative  analysis. 

This  comparative  ran  lysis  of  areas  of  high  and  low  participation  can  he 
summarised  in  the  following  scheme: 


Scheme  of  I actors  Accounting  for  Political  Participation 
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techniques,  but  a supplement  :o  them  v Ik  ■ dealing  with  situations  beyond  the 
pow  er  of  available  rational  technology,  which  created  severe  emotional  strain. 

Of  course  ch?  me  of  companions  of  small  numbers  of  cases  as  te>ts  requires 
great  camion;  care  must  be  taken  to  see  that  other  significant  factors  are  in  fact 
equal,  and  that  cases  are  selected  in  an  unbiased  manner. 

IV.  MATRIX  FORMULATIONS 

Sometimes  the  analysis  of  qualitative  observations  confronts  a mass  of 
particular  facts  of  such  great  number  and  variety  that  it  seems  quite  unwork- 
able to  treat  them  individually  as  descriptive  attributes  or  in  terms  of  their 
specific  interrelationships.  In  such  a situation  the  analyst  will  often  come  up 
with  a descriptive  concept  on  a higher  level  which  manages  to  embrace  and  sum 
up  a great  wealth  of  particular  observations  in  a single  formula,  fake  for 
instance,  Ruth  Benedict’s  description  of  the  Zu;ii  Indians,  which  mentions  their 
avoidance  of  drugs  and  alcohol,  their  lack  of  personal  visions,  their  placid 
response  to  divorce,  their  “mild  and  ceremonious”  relation  to  their  gods  and 
so  on.  After  presenting  a great  many  <u:h  particular  iacts,  Benedict  is  able  to 
sum  them  up  in  a single  formula:  The  Zuni  culture  has  an  Apollonian  pattern  - 
that  is,  a central  theme  of  avoidance  of  cmotior.il  excess,  i hts  pattern  or  theme 
permeates  every  aspect  of  Zuni  life1.  Such  a formula  capable  of  summing  up 
iii  a single  descriptive  concept  a great  wealth  of  particular  observations  may 
be  called  a matrix  formulation,  film  demotion  coveis  the  notion  of  a “Bade 
pattern”  of  a culture,  a “theme”,  an  “ethos”,  a “/eitgekt”  or  “mentality  of  the 
time”,  a “national  character”,  and  on  the  level  of  the  individual  person  a 
“personality  type”. 

Matrix  formulations  may  be  applied  to  complex  units  at  any  level.  In  a 
study  o)  an  unemployed  village  in  Austria,  the  researchers  made  a collection 
of  separate  “surprising  observations”.  Although  they  now  had  more  time,  the 
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the  people  h.u!  nothing  ^ ; 

kind  of  general  pm-dysb  of  mental  coerced.  do.;„tulcJ  iXS  “in- 

la  a study  of  a Particular  group  - | ^ prt,Wcnl  of  ex- 

flucmials’’  in  an  American  ^omnium  / 1 ..  e.-t-  tion,  proved  of  no  avail  in 

plaining  their  diverse  behas.ors.  ‘n av-,iJAme.  The  particular  be- 

jr*5  w!  fct»J  -'thVrciji  or  new,  rapine.  - 

havior  on  v.tucn  tlit  ..^ean  , , 0r  H[:.-rere  imoressions 

remained  unexplained.  In  trying  *?  -ifa  one 

not  closely  related  one  10  the  others  m i ^ ^ ^ w 

general  “theme"  winch  du  mgtmW  u-  ^ ^ commu,uty,  while 

politan”,  prhnai dy  interested  in  th.  • ••  • , - { Uv  jt;?K.  Merton 

others  were  "local”,  primarily  mteresteu  .u  uk  lo ■> ■’  : wlwn  be 
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in  causal  relations  and  process  with  one  another1.  *\  lie  element  of  “reciprerad 
mm”  for  example  can  he  core!  J .-red  as  grov/ine,  ou:  of  the  element  of  “pro- 
longed  face-to-face  association  with  thn  ;.me  people”,  ns  can  a great  many  of 
the  other  clvArnaeristics  of  a “Gen  ♦cinsrha  (V’-sUuation.]  n the  m at  v!>;  formulation 
of  an  “anal  personality”,  the  behavior  cha;  v.cccrhtici  are  thought  o\  ns  nil 
arising  out  of  n single  bade  faetor,  the  fixation  of  ti  c ciotic  development  nt  a 
certain  ddldidi  stage.  In  in  any  of  the  culture-pattern  formulations  m nee  ISene- 
diet's  purely  descriptive  ones,  v/«  find  the  idet  that  all  of  the  elements  in  the 
pattern  are  products  of  the  jutuin  of  child  trod  mug  - or  even  of  a single 
ekvierr  In  the  child  training  pattern.  Some  matrix  iormuhuions  invoice  a 
nuxtiiioof  descriptive!/  related  ni:d  causally  related  demerits. 

Yet  another  way  in  which  matrix  formulae  ca:i  be  differentiate J could  be 
called  their  “projective  dm/mce”.  The  following  example*  should  inddeate 
what  is  meant  by  this  dimension.  In  Mermr.'s  stud)  oi  it  was 

found  tli at  a vide  vr nr-./  of  ivmcrv;  made  by  the  Kite  Smith  Jevot.vs  could 
be  suir4nnd  i:p  in  the  notion  of  “subnnssh  ^:.e*s tothest.uus tpi'/’n  1 hey  believed 
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iVuM'ti*  gives  as  crmmph-s  of  secondary  descriptive  schemes  typologies  of 
social  relation*!. ips,  pecmi.dity  type?,  and  descriptive  categories  applied  to 
groups.  While  Parsons’  concept  is  not  entirely  identical  veith  that  of  the  matrix 
formulation,  it  illustrate*  mudi  of  the  reasoning  behinJ  such  complex  de 
scriptivo  "concepts  which  sum  vp  and  render  manageable  ^ large  at  d varied 
body  of  individual  points  of  data. 


V.  QUANTA!  I Vi.  SUPPORT  OK  THI.ORY 

So  far  v.e  have  main!)  discussed  v.  ays  in  whiui  cjmbtatisv  data  can  con- 
tribute to  the  formulation  of  psoblens  cl\>di  cations,  and  hypothecs.  Quali- 
tative Material*  are  particularly  suitable  fo:  t!  i;  exploratory  pha?e  of  re -card:: 
their  wealth  of  detailed  descriptive  element?  gives  the  emdys:  the  maximum 
opportunity  to  find  vine?  co  d sugjmdo.:?.  ]’or  testing  hypotheses,  on  the  other 
band,  the  ideal  model  vvrM  be  the  controlled  experit  tent,  with  precise  meas- 
urements on  a lim’tcd  number  of  preselected  variable*. 

*lhe  me  of  controlled  cvp?rim:r.ts  in  social  science  is  me  re  olive,  but  it  re- 
main* severely  broited.  Ueceru  years  1 .we  ed  a seen  a greet  dewlopme*.!  o.1 
ejuant  b alive reg  ard  . employing snm  re;  yh  approv.n  \t»;  :;s  of  the  s .perirtent  *1 
design  as  the  conwolbd  observation  of  r.v.iir.l  processes,  or  the  correlational 
am!;,  d,  of  cro.-s-s.ctlona!  sruyf.  The*e  technique*  p.ovid:  ten*  for  certain 
thcoii-;*.  \herc  u tain,  ho*.  ?m;or  ateas  in  >V :vh  t’norio  ate  supported 
maiutv  by  cgj  -h*  • tivc  date. 

i he  Crt'Ki.  »i if  fh^  /'/cos  o/  5.vr / *?.ff 

'Ihk  word  ‘ theory’*  hi*  actmPy  a number  of  di'Tcrc;*!  meanings  r.v  gi^e 
frori  bro  ;J  general  orient  .tin*  to  precise  prop  ad  ?*  v.sh  The  theories  for  which 
cj .1  \! ; : a o \ c >cpto.t  is  TiiO  t vfc’i  arc  ralati wly  large-sr  ue,  wjde-tangmg 
?vsti*.i*or  rcliti  * ! p I - r tA.voph.  large-#  ode  theories  o»  social  eh  a age  ?..u-T 
r«. I)  rp  m the  cu  dhuive  dmt  rf  r.lsiorical  record*:  tb.^Ie*  of  th:  ftwuoT-*. 
of  oj  caria  :v!cn*  ;a,J  iroit.olon  ■ ..  . booed  largely  or*  i\.  d. : ***ve  c:  wrlptio:.*; 

h.  C V.e 


r.c  *i  a 
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drew  on  ; yjpvp.es  from  literature  ;.s  well  as  roll  caso-h:v.or:es  . nut  Tor  t;.e  u>i 

ve  have  in  mind  it  is  important  tlut  the  examples  are  real.  Since  they  are  not 
sysicmadecJly  samples  01  puviwiy  measured,  they  do  rot  cuur  rigorous  proof 
in  any  statistical  or  experimental  sense.  Yc:  according  their  tmmo.w,  image, 
and  tchatior  to  the  reader *s  oven  experience  they  olTer  varying,  degrees  of 
suppeit  oi  corroboration.  It  is  this  function  which  vec  widi  to  examine. 

PsyJioanalyiis,  Marxism,  and  other  theories  of  history  in  genera!  or  of 
overall  pemorulhy  development  arc  lir  e and  complicated  .structures.  The  use 
of  qua!  tat i vc  da.ti  in  supporting  such  cotnph.x  theoretical  systems  Is  one  of 
the  major  undeveloped  area;  for  methodological  analysis.  In  order  to  l:rd 
manageable  illustrations  of  the  use  of  qualit.uiae  Jita  t > support  theory,  we 
will  rc?uict  ourselves  to  a much  simpler  type,  whuh  may  be  called  ‘'trend 
theories1’,  These  are  theories  which  call  attention  to  one  particular  trend  in 
society,  usually  derived  from  sonic  underlying  change  in  the  econonre  or  demo* 
graphic  stiuciurc. 

Vc  a ill  conside.  three  such  theories.  Frida  I'roram  I.as  suggested  that  a major 
Kiulcnca  on  our  time  is  "sclfstlienattoji".  resulting  from  the  ira<ecuiv.;cs  and 
disruption  of  social  bonds  brought  about  !*»  the  ti;co‘  the  market  economy  and 
ii  Ju:tr*.a’i/.itton.  C,  Ydnght  Mills  suggests  that  tN  ns;  r{  th?  big  c«ty  aud  the 
StatiJ.udi?. at! :n  of  t stc$  subjects  inercaduv;  :;u*  voers  oi  people  to  status- inse- 
curity. I ns  swell  propor:<  the  "developmental  coiiatnKt"  of  the  garnf.  -1  state* 
whtch  at  temp;*  to  work  out  the  logical  impheati*  n ol  the  tendency  toward  an 
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mass  ozgnui? atiev  yd  r-.cntri^g  a xtf>,  develop5  ‘hautoonnu-.n  ccttformiy*  as  a 
media  : .n  of  ewa.pc: 

#,TMs  p.ru.  * * - *dv.r  bn  U the  «v  ;uon  thr  t1  «.  rr  a *:*!■;  cf  r er  r'd  ind.vi* 
d-.d,  - i in  ruhn  o.kv.  T*  pvt  it  brink,  in  *kh’  d cwu  t-  * . FVse!:. 
K%  i h i :>  r:rdv  d c hi  d cf  uvd’tv  c W tv  Fr~  l y < V r.d  ; : cars;  • d 
hr  thwcfcrc  V cow,  t.ocs'v  as  a11  v\;s  arc  and  as  th:y  , ,t  5 hr  t-h  d ’ 


■ r .*  > l\-' ' h'J 
• I *■;  : : - . f 


r:  * . pm  h i v < v 

• v .v  ; -V*  m:  d.' 


^7 


Allen  H.  Barton  and  Faul  F.  Lazarsfcld 

us  ce  l.ov  ] romm  *.ve>  .veo.ti  vapyurtin  * :M«c*vut  ntlor.  derived  l:o:.t  l.:% 
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INTRODUCING  ANOTHER  VARIABLE 


One  of  the  most  important  tasks  In  the  analysis  of  data  is  to 
discover  what  we  can  about  cause  and  effect  relationships.  This 
does  not  mean  that  we  can  conclusively  prove  our  assertions, 
since  they  often  turn  on  assumptions  which  have  not  yet  been  tested 
or  which  are  untestable  for  all  practical  purposes.  But  we  still 
must  be  as  explicit  as  we  can,  showing  the  logical  structure  of 
our. arguments  and  the  points  at  which  empirical  data  are  rele- 
vant. It  is  the  nature  of  this  task  which  is  discussed  by  Pro- 
fessors Goode  and  Halt,  using  terms  such  as  intervening  vari- 
ate, antecedent  variable,  specification,  and  so  on,  which  arc 
part  of  the  working  vocabulary  of  the  social  sciences. 


From  William  J.  Goode  and  Paul  K.  Hatt,  Methods 
in  Social  Research,  Copyright  1961  by  McGraw- 
Hill  Book  Company.  Used  by  permission  of  McGraw- 
Hill  Book  Company,  pp.  313-358. 


Ylilliam  J.  Coodc  and  Paul  K.  Kat t,  Mct:l.ods  in  Social 
Research  (Kew  York:  KeGrav-Hi  11 > 1952)  , pp  313-358. 


A main  thread  of  thought  running  through  this  volume  is  that  sociology 
rests  upon  the  same  foundations  as  do  all  other  sciences.  Its  assumptions 
regarding  the  empirical  wot  Id  ate  the  same,  and  the  precautions  which  the 
sociologist  must  observe  in  tallying  out  icsearth  tmnt  also  be  observed  by 
scientists  in  other  fields.  We  are,  then,  at  the  beginning  of  an  era  in  which 
the  conscious  application  of  scientific  methods  to  sociological  problems 
should  be  marked  by  great  advances. 

This  premise  is  also  based  upon  a fundamental  assumption  of  sociology, 
constantly  borne  out  by  daily  observation,  that  there  is  an  orderliness 
within  social  phenomena.  Just  as  the  psychiatrist  finds  a symm  or  struc- 
ture within  the  seemingly  most  incoherent  expressions  of  a selir/ophrcnc, 
so  can  the  sociologist  find  predictability  and  order  within  the  most  anomic 
or  disorganiied  strata  and  social  groups.  The  intimate  spontaneity  of 
friendship  groups  exhibits  this  outcilincss  no  lc.-s  than  the  impersonal 
memoranda  within  a bureaucracy.  Indeed,  it  the  behavior  of  other  mem- 
bers of  socict) 'were  not  predictable,  it  would  not  be  possible  to  communi- 
cate, interact,  or  maintain  any  of  the  common  enterprises  in  which  we 
take  part. 

It  follows,  then,  that  modem  research  must  reject  as  a fabe  dichotomy 
the  separation  between  "qualitative”  and  "quantitative”  studies,  or  be- 
tween the  "statistical"  and  the  "nons'atimtal  ’ approach.  The  application 
of  mathematics  io  sociology  docs  not  ensure  rigor  of  proof,  any  more  than 
the  use  of  "insight"  guarantee?  the  significance  of  the  trsearcb. 

The  fundamental  questions  to  a«k  about  at!  research  techniques  are 
those  dealing  with  the  precision,  reliability,  and  relevance  of  the  da ta  and 
their  analysis:  (1)  how  precise  arc  the  observations?  \l) tan  other  scientists 
repeat  the  observations?  and  (3)  do  the  daf*t  actually  satisfy  the  demands 
of  the  problem,  that  is,  do  they  actually  demonstrate  the  conclusion?  If 
the  observations  are  crude,  casting  them  in  a statistical  form  will  not  help 
the  research.  If  other  scientist'  cannot  repeat  them,  mathematical  manipu- 
lation is  futile.  If  the  data  do  not  satisfy  a rigorous  logit  of  proof,  the 
conclusion  remains  doubtful. 

Furthermore,  no  nutter  how  precise  measurement  may  fcc,  that  uhick  ir 
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measured  remains  a quality*  Quantification  simply  achieve*  greater 
precision  and  reliability  in  Measuring  the  qualities  which  arc  considered 
important- intensity  of  anti-Semitic  altitudes,  degree  of  social  cohesion, 
conformity  with  moral  rules,  cic\  The  process  of  achieving  precision  leads 
to  the  clarification  of  ideas  and  helps  to  recast  substantiv  knowledge,  but 
in  a fundamental  seme  the  research  may  nevertheless  be  exiled  qualitative. 

Similarly,  the  most  "qualitative”  of  social  research  attempts  rough  mcas- 
urement.  The  historian  speaks  of  "a  growing  amimonarcliical  feeling 
during  the  reign  of  Louis  XVI."  The  anthropologist  contrasts  the  intensity 
of  emotion  aroused  by,  say,  the  murder  of  a kinsman  as  against  the  murder 
of  a tribal  enemy.  The  economist  may  single  out  those  who  find  the 
security  of  income  fir  more  important  than  the  promise  of  great  rewards 
when  they  arc  considering  the  choice  of  an  occupation.  The  sv  biologist 
comments  that  the  individual  who  is  well  integrand  within  a strongly 
cohesive  group  is  better  able  to  withstand  emotional  shocks  than  other 
individuals.  In  each  of  these  statements,  there  arc  implied  measurements 
of  important  qualities.  We  may  or  may  not  lie  satisfied  with  the  degree 
of  precision  in  these  notions,  but  they  are  essentially  attempts  to  measure 
the  effect  of  different  variables.  It  is  equally  clear  that,  when  data  are  suf- 
ficiently precise/  statistical  techniques  can  simplify  the  task  of  understand- 
ing them.  Such  techniques  aie  aids  in  research,  just  as  good  methods  of 
recording  data  can  be,  and  they  should  be  used  whenever  the  problem 
permits  it. 

Thus,  the  incitasing  use  of  statistics  is  not  the  distinguishing  feature  of 
modern  social  research.  Rather,  it  is  the  increasing  precision  and  reliabil- 
ity of  research  techniques,  and  higher  standards  of  proof,  which  have 
made  the  use  of  statistics  more  fruitful.  In  ttn n,  the  increasing  fiuit  fulness 
of  statistical  manipulation  hns  stimulated  further  developments  in  both 
sampling  and  statistical  theory;  while  the  needs  of  modern  statistics  have 
stimulated  still  greater  pi  cc  is  ion  and  reliability  in  the  collection  of  data. 

These  developments  have  occur  ted  on  many  fronts,  such  as  the  better 
identification  of  the  important  social  variables  increased  precision  of  the 
questions  used  in  schedules,  and  a better  grasp  of  interviewing  techniques, 
among  others. 

However,  most  of  this  growth  may  classified  as  techniques  for  stimu- 
lating responses,  or  obtaining  observations,  which  are  easier  to  categorize. 
Mote  precise  questions  allow  the  answers  to  bs  analyzed  more  easily.  The 
mastery  of  interviewing  permits  deeper  probing  in  the  search  for  more 
precise  answers  from  respondents.  What,  then,  of  those  sources  of  data 
which  have  not  been  structured  previously  b)  such  techniques?  Most  of 
our  daily  observation  and  experience,  the  newspapers  and  magazines  we 
read,  the  radio  programs  to  which  we  listen,  as  sveti  as  historical  records 
and  the  reeoi  led  protocols  of  psschiatric  and  other  depth  interviews,  are 
all  essentially  "unstruciurcH*  but  may  be  important  sources  of  data  for 
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certain  sociological  problems.  If  such  data  are  eliminated  from  consid- 
eration, the  range  of  information  available  is  narrowed  and  much  of  the 
richness  of  social  experience  may  be  lost.  On  the  other  hand,  if  they  are 
utilized  as  they  occur,  little  order  appears  in  them,  and  few  fellow  scien- 
tists might  agree  to  any  one  interpretation  of  them. 

Techniques  are  therefore  being  developed  which  permit  us  to  order 
ar.d  analy;*  such  data.  Since  most  of  these  procedures  are  really  ways  of 
classifying  data  which  were  not  originally  created  for  research  purposes, 
the  term  qualitative  coding  is  usually  applied  to  them.  When  qualitative 
coding  is  applied  to  the  content  of  various  communication  media  such  as 
magazines,  newspapers,  radio  programs,  or  similar  materials,  it  is  called 
content  analysis.  Since  the  most  effective  application  of  the  care  Airfory  or 
case  study  to  social  research  depends  in  large  part  upon  qualitative  cod- 
ing, that  special  problem  is  included  in  the  following  discussion. 

Succeeding  sections  will,  then,  deal  with  these  subjects:  (1)  simple  coding 
operations,  (2)  qualitative  coding,  (5)  content  analysis  as  one  application 
of  qualitative  coding,  and  (4)  the  case  study. 

SIMPLE  CODING  OPERATIONS 

The  student  may  feel  that  there  is  little  point  to  a discussion  of  coding 
operations,  since  they  are  usually  bracketed  with  large-scale  suiveys. 
Although  it  is  true  that  such  surveys  almost  always  code  the  materials 
gathered,  the  student  may  find  it  profitable  to  consider  whether  a small 
project  might  also  benefit  from  coding  the  data.  If  the  class  attempts  a 
joint  project,  or  breaks  into  several  project  groups,  coding  may  be  the 
most  effective  means  of  handling  the  data.  A brief  explanation  of  this 
tool  will  tdlow  the  student  to  make  a decision  concerning  the  use  of  coding. 

HTrcrt  fo  code.  Coding  is  an  operation  by  which  data  are  organized  into 
classes,  and  a number  or  symbol  is  given  to  each  item,  according  to  the 
class  in  which  it  falls.  Thus,  counting  the  symbols  gives  us  the  total 
number  of  items  in  any  given  class.  The  basic  operation,  of  course,  is 
that  of  classification.  Aligning  the  number  or  symbol  to  a given  datum 
then  becomes  a mechanical  procedure.  How  to  classify  must,  naturally, 
depend  upon  the  questions  which  have  been  asked  and  the  concepts  which 
are  used  in  the  particular  research.  These  problems  are  discussed  in  the 
earlier  chapters  of  this  volume,  and  examples  ate  given  in  this  chapter. 
Let  us  now  ask  the  practical  question,  "When  is  it  profitable  to  code?" 

The  answer  depends  mainly  on  three  variables:  (1)  the  number  of 
respondents  or  sources  of  data  in  out  study;  (2)  the  number  of  queifioni 
asked;  and  (3)  the  number  and  complexity  of  itoftjfiraf  operations  planned 
for  the  study.  If  the  number  of  cases  is  large,  any  kind  of  tabulation  be- 
comes difficult  unless  the  data  are  coded.  The  individual  sheets  for  each 
rase  become  sepatated  aftet  a few  shufflings,  or  they  become  torn  and  worn. 
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Small  errors  in  counting  the  answers  require  handling  all  the  sheets  once 
more.  By  the  use  of  coding  procedures,  however,  retabulations  may  be 
avoided  or  minimized.  f(  the  number  of  questions  is  large,  the  same  set  of 
considerations  holds. 

Most  important,  however,  there  is  no  easy  way  to  carry  out  complex 
cross  tabulations  without  some  form  of  coding.  Any  statistical  operation 
requires  the  manipulation  of  numbers,  which  in  turn  must  represent  the 
data  from  the  schedule.  It  is  possible  cither  to  make  many  piles  of  sheets 
for  each  comparison  or  analysis,  or  to  give  numbers  to  the  answers  and 
summarize  these  separately  on  other  sheets  or  cards.  In  the  latter  case,  the 
operations  are  much  simpler.  The  more  complex  the  operations  planned, 
the  more  useful  is  some  form  of  coding. 

At  ivhat  stage  to  code.  Coding  can  be  cairied  out  at  any  phase  in  the 
study,  from  the  interview*  itself  to  the  period  just  prior  to  the  tabulations. 
In  the  section  on  formulating  the  questionnaire,  mention  vas  made  of 
prccoded  questions.  These  were  questions  which  had  already  been  field- 
tested  (or  meaning  and  range  of  possible  answers,  and  which  were  physi- 
cally set  up  on  the  schedule  so  tint  checking  the  answer  automatically 
coded  the  data.  An  example  would  be  the  following: 

1 2 

Are  you  a veteran  of  World  War  \l)  (Circle  answer)  ~ 

Answers  which  arc  set  up  in  this  fashion  can  be  tabulated  very  easily  by 
hand,  or  they  may  be  punched  directly  onto  cards  for  machine  tabulation. 
In  this  case  the  interviewer  is  actually  coding  as  he  goes  alone,  although 
no  separate  operation  is  required. 

Similarly,  the  interviewer  may  be  asked  to  do  the  coding  as  soon  as  he 
hears  the  answer.  This  can  be  done  in  a fairly  cautious  fashion,  as  an 
almost  automatic  operation: 

How  did  your  husband  fed  ?.bout  this?  I.  Strongly  approved;  2.  Mildly 
approved;  5.  Indifferent;  Mildly  disapproved:  5.  Strongly  disapproved.  2 

When  the  respondent  answers,  the  interviewer  has  only  (o  mark  the  proper 
number  in  the  coding  margin. 

Suppose,  however,  that  the  goal  is  to  classify  respondents  in  terms  of 
annual  income , while  many  of  them  are  laborers  who  typically  think  of 
their  ea.nings  in  terms  of  daily  wages.  It  would  be  possible  to  ask  them 
for  their  total  annual  earnings  directly.  This,  however,  might  cause  them 
to  make  important  errors  in  arithmetic.  Second,  the  interviewer  might 
cany  out  the  operation  mentally,  coding  the  totai  earnings  in  its  proper 
class  after  calculating  daily  earnings  times  the  total  estimated  number  ol 
days  or  weeks  worked.  Although  this  is  a simple  procedure,  experience 
sliow-s  that  even  the  best  inteivieveis  may  make  vrrora.  Most  important, 
however,  tnere  is  no  way  of  discovering  the  error,  s»nce  the  only  figure 


collected  by  the  Office  of  United  States  Courts.  " As  a result,  we 
cannot  be  sure  if  the  cases  tried  by  one  method  are  the  same  as 
those  cases  tried  by  another — and,  as  a consequence,  we  cannot 
be  sure  if  any  difference  in  outcome  is  due  to  the  difference  in  the 
method  of  trial  or  some  other  factor.  The  student  should  note  how 
the  authors  come  to  grip  with  this  problem. 

As  for  the  use  of  simple  mathematics , the  reading  makes  it 
clear  that  the  law  student  can  engage  in  important  research  withmt 
necessarily  using  formulas  for  correlation,  tests  for  significance, 
etc.  In  fact,  the  authors  rely  heavily  on  simple  percentages  and 
simple  logic — but  the  methods  fit  their  purpose  and  the  results  are 
more  than  adequate. 


From  Hans  Zeisel  and  Thomas  Callahan,  ''Split  Trials 
and  Time  Saving:  A Statistical  Analysis, " Harvard 
Law  Review,  76  (1963),  pp.  1606-1625.  Reprinted 
by  permission. 
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In  iqs Q the  United  Stoics  District  Court  for  the  yorthern  District 
of  Illinois  adopted  a talc  permitting  separate  trials  of  liability  and 
damage  issues  in  civil  cases.  At  the  request  of  tic  court  the  authors 
have  conducted  cu  investigation  to  determine  the  extent  to  which 
time  is  saved  through  use  of  the  separation  device.  They  use  a 
variety  of  converging  statistical  approaches  to  arrive  at  the  conclu- 
sion that  in  persona!  injury  jury  (rial  cases  about  twenty  per  cent 
of  trial  time  may  be  saved.  They  then  consider  the  possible  ways 
that  separation  might  lengthen  trial  time , and  con  elude  that  these 
will  not  offset  the  time  saving  to  any  degree. 

Among  the  devices  aimed  at  reducing  the  trial  load  of  our 
courts,  the  split  or  separate  trial  of  issues  has  been  proposed  as 
holding  special  promise.  Trial  time  would  be  saved,  the  argument 
goes,  if  the  issues  of  liability  and  damages  were  tried  separately. 
The  jury  would  be  required  first  to  bring  in  a verdict  on  liability, 
and  only  if  liability  were  found  would  the  jury  hear  evidence  and 
render  a second  verdict  on  the  issue  of  damages.1  On  the  recom- 
mendation of  the  late  Judge  Juliu«  H.  Miner,2  the  United  States 
District  Court  for  the  Northern  District  of  Illinois  was  persuaded 
to  test  this  device,  and  on  November  3,  1959,  adopted  civil  rule 
21,  set  forth  below,3  permitting  separate  trial  on  motion  of  a party 

t We  arc  happy  t«>  acknowledge  the  generous  support  0/  this  study  by  the  Wal- 
ter E.  Meyer  Research  Institute  of  Law.  This  investigation,  an  offshoot  of  the 
Unlver*ily  of  Chicago  L?.\v  School's  prior  work  on  the  jury  system  and  court 
congestion,  was  conducted  in  closest  collaboration  with  Professor  Han/  Xalvcn, 
Jr.,  Director  of  the  Jury  Project. 

* Professor  of  Law  and  Sociology,  University  of  Chicrgo  Law  School.  Dr.  jt;r.t 
University  of  Vienna,  1927,  Dr.  rcr.  pol.,  19:8. 

##  TJurd-ycar  student,  University  of  Chicago  Law'  School;  Research  Assistant, 
Jury  Project,  since  1957.  M.A.,  University  of  Chicago,  1962. 

1 Such  a pbn  was  proposed  in  Zeisll,  Kaivjn  & Rvcuholz,  Delay  in  t iie 
Court  99  (1959)  {hereinafter  cited  aj  Delay], 

2 Mirer,  Court  Congestion:  A .V ew  Approach,  45  A B.A.J.  1265,  1268  (1959). 
Judge  Miner  died  on  March  ii,  19^3,  the  very  day  this  report  was  submitted  to 
his  court.  The  cause  of  court  administration,  and  especially  its  students,  owe  him 
a great  debt  of  gratitude  for  bis  determined  efforts  to  pioneer  what  promises  to  be 
a unique!}  powerful  delay  remedy. 

3 X.D.  III.  (Civ.)  R.  21,  set  forth  in  2 Frn.  Rules  Sfrv.  20  104S-49  (itfo)* 
Pursuant  to  and  In  fuither&nce  of  Rule  42(b),  Federal  Rules  of  CU il  Pro- 
cedure, to  curtail  undue  delay  in  the  administration  of  justice  In  personal  injury 
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or  at  the  direction  of  the  court.  Though  the  rule  was  sponsored  in 
order  to  save  trial  time,  a commodity  in  short  supply  in  most  met- 
ropolitan courts,  attempts  were  soon  made  to  upset  it  on  the 
ground  that  this  major  departure  from  the  traditional  mode  of 
trial  might  affect  the  substance  of  the  verdicts  rendered  by  juries. 
But  the  Court  of  Appeals  for  the  Seventh  Circuit  held  that  the  rule 
preserved  the  essentia!  character  of  trial  by  jury  and  was  within 
the  ambit  of  Rule  42(b)  of  the  Federal  Rules  of  Civil  Procedure, 
which  permits  separated  trial  of  issues  in  the  interest  of  conven- 
ience.4 

Prior  to  the  promulgation  of  the  wile,  Chief  Judge  Campbell 
asked  (he  Law  School  of  the  University  of  Chicago  to  design  a re- 
search operation  that  would  measure  the  effect  of  separation  on 
the  trial  load  of  the  court.8  A later  report  may  deal  with  the 
above-mentioned  substantive  considerations;  the  present  report 
treats  only  of  one  aspect  of  the  rule  — its  impact  on  trial  time. 

The  way  in  which  separation  of  issues  may  save  court  time  is 
clear.  In  the  traditional  form  of  trial,  the  damage  issue  must  be 
litigated  even  where  the  verdict  will  ultimately  reject  liability; 
separation  would  eliminate  the  need  for  trying  the  damage  issue  in 
those  cases,  comprising  roughly  40  per  cent  of  all  personal  injury 
jury  trials.®  Even  where  liability  was  affirmed,  it  seemed  likely 

and  other  civil  lii Ration  wherein  the  issue  of  liability  may  be  adjudicated  as 
a prerequisite  to  ihe  determination  of  any  or  nil  other  Issues,  in  jury  and 
non-jury  cjscs,  a separate  trial  may  be  had  upon  such  issue  of  liability,  upon 
moiion  of  any  of  the  parlies  or  at  the  Court’s  direction,  in  3"y  claim,  cross- 
chltu,  counterclaim  or  third-party  claim. 

In  the  event  liability  is  sustained,  the  Court  may  recess  for  pre-trial  or 
settlement  conference  or  proceed  with  the  trial  on  any  or  all  of  the  remaining 
issues  before  the  Court,  before  the  same  jury  or  before  another  jury4  as  con- 
ditions may  require  and  the  Court  shall  deem  meet. 

The  Court,  however,  may  proceed  to  trial  upon  all  or  any  combination 
of  issues  if,  in  its  discretion,  and  in  furtherance  of  justice,  I* *  shall  appear  that 
a separate  iiial  will  work  a hardship  upon  any  of  the  parlies  or  will  result  in 
protracted  or  cosily  litigation. 

* Ho>ie  v.  Chicago  & XAV.  Uy.,  *Sj  F.rd  639  (7th  Cir.  i960),  r cri  dotted,  365 
VS.  £7  j (1961).  The  court  did  not  decide  whether  the  use  of  different  juries  for 
liability  and  damage  issues  as  authorized  by  the  district  court  rule  violates  the 
seventh  amendment.  The  district  court  has  confined  the  operation  of  rule  jr  to 
single  juries.  C/.  O’Donnell  v.  VW.son  Rros.  Transp.  Co.,  183  F.  Supp.  5?;  (X  D. 
III.  i960). 

6 Throughout  the  study  Judges  Campbell,  Miner,  and  Robson  acted  as  advisers, 
for  which  wc  owe  them  a great  debt  of  gratitude.  We  also  want  to  thank  the  other 
Members  of  the  court  and  its  clerks  for  their  meticulous  help  throughout  these  two 
years.  We  are  especially  Indebted  to  Mr.  Roy  Johnson,  Sp'dal  Assistant  to  Judge 
Campbell,  and  to  Miss  AiJcene  C.  Coldby,  Chief  Deputy  Clerk,  for  their  untiring 
operation.  Appreciation  is  also  due  to  Mr.  Orin  S.  Thiel  and  Mr.  Ronald  H. 
I1'- ally  of  the  Administrative  Office  of  the  United  Slates  Courts  for  their  thought- 
f d help  In  supplying  us  with  a series  of  control  statistics  from  their  office. 

^Trom  a nationwide  sample  of  civil  jury  trials,  which  forms  part  of  the  basic 
*’  U of  the  Jury*  Project  of  the  University  of  Chicago  I.aw  School. 
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that  trial  time  might  be  saved  by  the  stimulus  to  settlement 
before  the  damage  issue  is  litigated.  There  remained,  however,  the 
possibility  that  those  savings  might  well  be  offset  by  a number  of 
countervailing  factors  not  so  immediately  obvious.  Thus  the 
separation  of  issues  might  reduce  the  proportion  of  cases  settled, 
and  thereby  increase  the  number  of  cases  requiring  trial.  It  might 
reduce  the  ratio  of  jury  waivers  and  thereby  add  to  the  number  of 
cases  requiring  jury  trial,7  Separation  might  increase  the  propor- 
tion of  hung  juries,  and  necessitate  more  retrials.  Finally,  it  might 
simply  require  more  time  of  juries  overall,  since  in  some  cases 
there  would  be  two  deliberations.  The  crucial  question  for  this 
study  was,  therefore,  not  so  much  whether  but  rather  how  much 
time  would  be  saved  by  the  separation  of  issues. 

Because  our  inquiry  was  set  up  only  shortly  before  the  adop- 
tion of  the  separation  rule,  we  were  unable  to  acquire  data  on  the 
period  before  the  rule  which  we  should  have  wished  to  have  in 
order  to  compare  the  operations  of  the  court  before  and  after  the 
separation  rule  was  introduced.  Instead,  for  our  prior  control 
period  we  had  to  rc-ly  for  information  on  data  routinely  collected 
by  the  statistical  branch  of  the  Administrative  Office  of  the  United 
Stales  Courts.  The  ideal  research  design  would  have  combined 
such  a prior  observation  period  with  a selection  by  chance  of  the 
cases  to  which  the  separation  rule  would  be  applied.  Specifically, 
one  might  have  subjected  every  three  successive  cases  filed  to  a 
sort  of  lottery:  in  the  case  drawn  first,  separation  would  be  re- 
quired; in  the  second,  sepaiation  would  not  be  allowed;  in  the 
third  case,  separation  would  be  left  to  the  judge’s  discretion. 
These  groups  of  three  would  be  assigned  in  rotation,  giving  each 
judge  an  equai  number  of  cases  of  each  variant.  After  final  dis- 
position, the  separated,  the  regular,  and  the  cases  where  sepaia- 
tion was  discretionary  would  be  compared  and  measured  against 
the  period  that  preceded  the  introduction  of  the  rule.  But  this 
division  of  cases  was  of  course  not  feasible  for  a variety  of  good 
reasons.8  Instead,  all  that  could  be  done  was  to  keep  a careful 
record  of  the  court’s  actions  in  each  case  from  assignment  to  final 
disposition,  both  for  cases  in  which  there  was  separation  and  for 
the  regular  trials.  We  had  no  influence  and  of  course  did  not  at- 
tempt to  exert  any  influence  on  the  choice  of  cases  in  which  the 


1 The  New  York  experience  showed  that  roughly  40^  loss  time  is  used  up  In  a 
bench  trial.  Dklay  81. 

*C/.  The  Cme  lor  the  Official  Fxperir.ien!,  Dllay  141-50;  Zeisel,  The  Sew 
Vert  Expert  Testimony  Project:  Some  Reflections  on  Legal  Experiments,  S Stax. 
L.  Rtv.  750,  741-4?  (195S). 
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issues  were  separated.  Each  judge  applied  the  broad  discretion 
conferred  on  him  by  the  rule  according  to  his  own  understanding 
of  its  suitability  for  the  case  at  hand. 

I 

To  begin  the  analysis,  then,  we  present  a synopsis  of  the  civil 
cases  tried  before  the  court.  Table  i gives  the  proportion  of  cases 
tried  under  the  separation  rule  in  each  category.  The  table  shows 
that  the  relative  frequency  of  such  trials  varies  greatly  with  the 
type  of  action.  Two  categories  show  no  separation  whatsoever: 
admiralty  cases  and  tort  cases  other  than  personal  injury  claims. 
The  greatest  use  of  separation  was  made  in  personal  injury  trials: 
here  37  per  cent  of  the  jury  cases  and  27  per  cent  of  the  bench 
trials  were  tried  under  the  separation  rule.  Separation  was  ordered 

TABLE  1 

Relative.  Frequency  of  Separated  Trials  by  Type  of  Case  * 


jRfguter  Trials  Separated  Trials 


Percentage 

Number  Number  cf  Total  TOTAL 


Personal  Injury*.  b 

Jury 

XI7 

69 

37% 

186 

Non  jury 

22 

3 

27% 

30 

216 

Other  Tort: 

Jury 

10 

, , 

. , 

10 

Nonjury 

4 

4 

14 

Contract: 

Jury 

79 

6 

24% 

25 

Nonjury 

17 

2 

u% 

29 

44 

Federal  Tort 

Claims  Act: 

Nonjury 

10 

4 

29% 

I4. 

14 

Admiralty: 

Jury 

i 

, . 

1 

Nonjury 

6 

6 

7 

Other: 

Jury 

8 

8 

1 

Nonjury 

34 

4 

n% 

J8 

46 

TOTAL: 

Jury 

res 

75 

33% 

230 

Non  jury 

93 

18 

16% 

III 

243 

93 

27% 

311 

' For  ^.c  l\vo-ycar  period  after  introduction  of  rule  21 

(1960-1961), 

1 Indcdcs  16  jury 

and  6 nonjury  FEE  A efisc-s,  of  wldc'i  enc  jury 

trial 

\va» 

ftps  rated 
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in  only  one  out  of  twenty-two  FI5LA  cases  but  in  29  per  cent  of 
the  fourteen  Federal  Tort  Claims  Act  cases.  These  frequencies 
forced  us  to  limit  our  inquiry  to  personal  injury  cases:  only  this 
category  provided  a sufficiently  large  group  of  separated  (69)  and 
regular  (117)  trials  to  offer  an  adequate  basis  for  statistical  com- 
parison. It  would  be  improper  to  compare,  for  instance,  separated 
tort  cases  with  nonseparatec!  contract  cases,  since  the  validity  of 
any  comparison  requires  that  the  cases  be  kept  as  homogeneous 
as  possible.  Strictly  speaking,  therefore,  conclusions  of  the  study 
will  apply  only  to  personal  injury  jury  cases,  but  within  reason 
they  should  prove  applicable  also  to  other  types  of  cases. 

Table  2 proceeds  to  the  comparison  of  separated  and  regular 
personal  injury  trials.  The  1S6  trials  by  jury  during  the  obser- 
vation period  are  divided  by  tire  stage  at  which  they  terminated. 
We  distinguish  four  stages  for  the  separated  trials  and  four 
roughly  parallel  stages  for  the  regular  trials.  The  midpoint 
for  the  regular  trial  comes  at  the  end  of  the  plaintiff’s  case;  for 
the  separated  trial,  at  the  end  of  the  liability  trial. 


tabu:  2 

Stacks  or  Disposition  and  Time  Required  To  Reach  Them  5 


Separated  Trials 

Regular  TrfaU 

Sta^e  cf  PjVpositliin 

Days 

Ter  Cent 

St?.£e  of  Difpo.*3ii<in  Days 

Per  Cent 

During  Liability 
Trial 

1.9 

19% 

During  Plaintiff’s 
Case  2.0 

18% 

A*,  End  nf  Liability 
Trial 

62% 

At  End  of  Plaintiff’s 
Case  3.3 

4% 

During  Damage 
Trial 

3-6 

4/& 

During  Defendant’s 
Case 

After  Full  Trial 

4.0 

is?° 

After  Full  Trial  4.7 

78% 

Average  All 
Cases 

3.1 

100  % 

Average  All 

Cases  4.2 

1C0% 

Number  of  Cas 

es 

(69) 

Number  of  Cases  (117) 

•Personal  injury  jury  trials.  Only  court  time  is  considered;  deliberation  time  Is 
not  included.  Figures  in  the  “Days”  column  arc  averages  for  cases  disposed  of  at 
the  stage  indicated, 


It  was,  to  repeat,  the  expectation  of  those  who  favored  the 
new  rule  that  many  separated  trials  would  end  after  the  liability 
verdict,  thus  making  the  damage  trial  unnecessary.  As  can  be 
seen,  Tabic  2 vividly  confirms  this  expectation:  fully  7S  per  cent 
of  the  regular  trials,  but  only  15  per  cent  of  the  separated  trials, 
go  through  their  full  course.  And  since  many  more  regular  trials 
run  their  full  course  than  do  separated  trials,  it  must  be  expected 
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that  the*  average  trial  time  for  each  is  quite  different.  The  figures 
for  trial  time  in  Table  2 hear  this  out.  showing  that  the  average 
regular  trial  lasts  4.2  days,  the  separated  trial,  ?.i  days  — a time 
difference  of  26  per  cent.  It  would  be  tempting  to  read  this  figure 
as  the  looked-for  measure  of  the  time  saved  by  separation.  Un- 
fortunately, it  cannot  be  accepted  at  face  value  because,  standing 
by  itself,  this  comparison  of  the  separated  and  the  regular  trials 
could  be  quite  misleading. 


II 

At  (his  point  we  ask  for  the  reader’s  indulgence  during  a some- 
what complicated  methodological  exclusion  that  will,  we  trust, 
make  clear  why  without  further  investigation  we  cannot  trust 
ihe  comparison  made  by  Table  2,  The  precise  question  is  this: 
Were  the  cases  selected  for  separated  trial  and  the  cases  left  for 
regular  trial  substantially  similar  except  for  the  fact  of  separation? 
If  they  were,  we  may  attribute  the  observed  difference  to  the  fact 
of  separation.  But  if  they  were  not,  we  are  in  trouble.  It  is  the 
same  methodological  problem  that  came  to  the  fore  in  the  now 
famous  debate  over  whether  smoking  shortens  our  life  expectancy. 
The  average  age  at  which  smokers  die  is  lower  than  that  of  non- 
smokers,  just  as  the  average  separated  trial  is  shorter  than  the 
average  nonseperated  one.  Does  such  a statistic  prove  that  smok- 
ing shortens  life,  or  that  separation  shortens  trial  time?  Suppose 
it  is  the  man  who  is  less  healthy  to  begin  with  who  takes  up  smok- 
ing. We  would  then  falsely  attribute  to  smoking  the  effect  of 
shortening  our  lifetime,  when  in  fact  the  average  smoker  had 
a shorter  life  expectancy  even  before  he  began  smoking.  Simi- 
larly, one  may  fmd  that  deaths  are  more  likely  to  occur  in  the  near 
future  in  families  recently  visited  by  physicians  than  in  those  un- 
| attended.  Obviously  it  is  not  the  doctor,  but  the  preexisting  situa- 

| tion  that  calls  for  a doctor,  which  causes  the  higher  mortality 

i rate.  A similar  difficulty  may  apply  to  the  figures  in  '’.’able  2: 
cases  selected  for  separate  trial  might  he  different  to  begin  with. 
If  that  difference  is  itself  related  to  the  time  saving  we  want 
measured,  then  we  no  longer  know  to  what  extent  the  apparent 
time  saving  was  due  to  differences  other  than  separation.  That 
was  why  our  ideal  research  design  specified  selection  by  lot  — 
by  avoiding  purposeful  selections  we  could  avoid  the  possibility 
oi  selections  on  some  basis  which  might  have  a bearing  on  trial 
time. 

Wore  our  separated  trials  selected  in  any  way  that  could  lessen 


Hans  Zcisel  and  Thomas  Callahan 


the  validity  of  the  comparison  h.  Tabic  2?  We  know  that  there 
was  purposeful  selection  by  the  judges.  Though  wo  do  not  know 
the  basis  for  their  selection,  a few  statistics  give  us  warning. 
From  Table  2,  we  sec  that  the  average  separated  trial  which  actu- 
ally went  to  the  jury  on  the  damage  question  took  4.0  days;  but 
it  took  considerably  longer  — 4.7  days  — to  try  the  regular  cases 
that  ran  their  full  course.9  There  is  thus  some  indication,  though 
no  proof,  that  the  cases  differ  by  selection.  There  is  no  proof, 
because  the  figure  for  separated  trials  is  based  on  only  a small 
fraction  of  cases  that  go  the  full  course;  the  potentially  longer 
cases  might  well  have  been  disposed  of  during  trial.  On  this  view 
the  difference  would  disappear  if  all  trials  were  full  trials. 

Further,  the  figures  in  Table  3 suggest  that  the  two  groups  of 
cases  were  not  selected  at  random.  Only  34  per  cent  of  the  regu- 
lar trials  ended  in  favor  of  the  defendant,  in  contrast  to  56  per 
cent  of  the  separated  trials  — a figure  far  above  the  average.10  Xote 
too  that  the  proportion  of  directed  verdicts  is  larger  in  the  sep- 
arated group.  Finally,  there  is  evidence  that  the  regular  trials  left 
over  after  some  cases  have  been  removed  for  separated  trials 
differ  as  a group  from  the  run  of  regular  trials  before  civil  rule 
21,  when  no  cases  were  separated.  Statistics  supplied  us  by  the 
statistical  branch  of  the  Administrative  Office  of  the  United  States 
Courts  indicate  that  of  the  196  personal  injury  jury  trials  con- 
ducted in  this  court  during  the  two  years  prior  to  the  introduction 


TABLE  3 

Outcome  and  Mode  oe  Termination  of  Separated  and 
Regular  Triais  4 


Mode  of  Termination 

Separated  Trials 
Per  Cent 

Regular  Trials 
Per  Cent 

Settlement 

32 

24 

Jury  Verdict  for  Plaintiff 

12 

42. 

Total  in  Favor  of  Plaintiff 

' 44 

"CG 

Jury  Verdict  for  Defendant 

43 

3i 

Directed  Verdict  for  Defendant 

13 

3 

. Total  in  Favor  of  Defendant 

5G 

34 

Number  of  Trials 

(69) 

(117) 

* Fcrsonal  injury  jury  trials. 


•The  trial  times  for  the  regular  cases  are  somewhat  high  compared  with  statis- 
tics available  from  other  courts,  but  there  are  no  ground*  for  suspecting  that  this 
should  restrict  the  validity  of  our  findings.  Wc  arc  concerned  only  with  the  rela- 
tive trial  time  of  separated  and  regular  trials,  and  this  relation  should  not  be 
affected  by  the  g-.neraUy  longer  trial  time. 

10  See  note  6 suprt. 
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of  rule  2i,  only  62  per  cent  of  the  cases  went  to  a jury  verdict, 
as  against  73  per  cent 11  of  the  regular  trials  after  rule  21.  This 
is  another  suggestion  that  judges  might  have  been  more  likely 
to  order  separate  trial  for  those  cases  which  held  a promise  of 
not  going  through  a full  trial. 


in 

So  much  for  the  potential  infirmities  of  the  data  in  Table  2.  Do 
our  data  permit  us  to  move  the  analysis  to  more  secure  ground? 
Table  4 will  show  the  proportion  of  personal  injury  jury  trials 
tried  by  each  judge  under  the  separation  rule:  the  ratio  varies 
from  89  per  cent  for  Judge  A down  to  zero  for  Judges  K and  L. 
Though  this  varying  use  seems  at  first  blush  unfortunate,  it  is 
this  very  variation  which  will  help  to  solve  the  problem.  For 
clarification,  we  return  to  the  smoking  analogy.  Just  as  we  could 
not  dictate  when  judges  were  to  use  split-trial  procedure,  we  could 
not  select  youngsters  at  random,  ordering  one  group  to  smoke 
and  another  not  to.  But  we  can  expose  one  random  group  of 
smokers  to  “stop-smoking”  propaganda  and  keep  another  random 
group  of  smokers  as  a control  group,  unexposed  to  this  propa- 
ganda. We  could  then  compare  the  longevity  of  the  “exposed” 
group  with  that  of  the  control  group.  Any  difference  could  be 
attributed  to  the  “stop-smoking”  campaign. 

Let  us  try  to  sec  more  precisely  how  this  analytical  procedure 
bypasses  the  difficulty  that  the  decision  to  stop  smoking  will  not 
be  made  at  random,  but  more  likely  by  persons  who  are  in  some 
way  special  with  respect  to  smoking  and  with  respect  to  health 
in  general.  The  point  is  that  in  this  design  we  neither  know  nor 
need  to  know  how  the  self-selection  works. 

Let  us  assume  the  worst  possible  case  from  the  point  of  view 
of  selection  bias:  suppose  the  only  smokers  who  respond  to  the 
stop-smoking  campaign  were  those  who  started  smoking  only  re- 
cently and  who  arc  at  the  same  time  particularly  health  conscious. 
Let  us  assume  also  that  it  is  only  their  lengthened  life  span  which 
would  cause  the  average  life  span  of  the  exposed  group  to  in- 
crease. Even  this  extreme  self-selection  would  in  no  way  affect 
the  validity  of  the  proposition  that  reduced  smoking  increases 
life  expectancy.  The  reason  wc  would  not  be  concerned  with  this 
self-selection  is  that  we  know  that  in  the  control  group,  too,  there 
would  be  these  health-conscious  smokers  who  only  recently  began 
to  smoke,  in  about  the  same  proportion  as  they  occurred  in  the 

n Table  3,  second  column:  43  31  r*  73. 
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experimental  group.  And  we  should  not  care  if  some  of  the  con- 
trol group  stopped  smoking  on  their  own,  without  exposure  to 
the  propaganda.  For  it  is  quite  sufficient  to  know  that,  whatever 
the  control  group  does,  the  experimental  group  also  does,  in  addi- 
tion to  the  experimental  group’s  reaction  to  the  stop-smoking  cam- 
paign. 

Now  instead  of  two  groups  of  smokers,  lot  us  take  two  groups 
of  trials,  one  conducted  by  Judge  A*  who  separates  some  of  his 
trials,  and  one  by  Judge  V who  separates  none.  Since  the  court 
assigns  cases  at  random,  the  two  groups  of  trials  will  be  com- 
parable. The  only  differences  between  the  groups  will  be  that 
there  was  separation  in  one  but  not  in  the  other,  and  conceivably 
the  differing  expeditinusness  of  Judges  .Y  and  F.  If  the  latter 
factor  is  discounted,  and  if  the  average  trial  length  in  the  group 
of  cases  where  separation  sometimes  occurred  is  below  that  of 
the  cases  where  it  was  not  available,  we  are  justified  in  ascribing 
this  difference  to  the  separation.  One  would  then  expect  Judges 
K and  L,  who  held  no  separate  trials,  to  show  a higher  average 
length  of  all  their  trials  than  Judge  A,  who  separated  in  S9  per 
cent  of  his  trials.  In  reality,  of  course,  there  arc  not  two  judges, 
but  several  judges,  each  having  a different  propensity  to  separate. 
We  should,  therefore,  expect  this  proposition  to  hold  true:  the 
larger  the  proportion  of  separated  trials  for  the  particular  judge, 


TABLE  4 

Separation  Ratio  and  Average  Trial  Time 
in  Personal  Injury  Jury  Trials 


Judge 

Proportion  of  Cd'C* 
Tried  Vr.dci  ivs  le  2 1 
(Ter  Cent) 

Avcragf  Perish  of 
All  Trial*  Before 
This  Jud^e 
(Pay*) 

Xutnbrr  of 
Trial*  Bt-lcre 
This  Judge 

A* 

89 

3-2 

06) 

B 

51 

3-3 

(41) 

C 

50 

2-3 

(10) 

D 

38 

3-5 

(26) 

K 

29 

6.1 

(7) 

F 

20 

S-6 

(s) 

G 

17 

4.2 

(6) 

II 

M 

3-8 

(22) 

l 

7 

3-9 

(27) 

} 

7 

43 

(14) 

K 

0 

4-5 

(0 

I. 

0 

4-5 

0) 

1 If  mi/’ 

it  he  relevant  to  rcvcr.l  tint  Jud/o  A wsi  not  Jwl^ 

Miner,  the  pr<;- 

poncrl  of  tJ'.e  rule. 


-\ 

7 

o 
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the  shorter  the  average  length  of  all  his  trials*  In  cur  case,  we 
should  expect  Judges  .1  and  B}  who  have  the  highest  separation 
rates,  to  show  lower  average  trial  lengths  fur  all  their  trials  than 
Judges  /,  /,  /v,  and  A.  Table  4 presents  llit  pertinent  statistics. 
It  is  as  we  expected:  as  the  separation  ratio  increases,  the  average 
trial  length  decreases.  The  exact  nature  of  this  relationship  be- 
comes even  clearer  if  we  put  Table  4 into  graph  form  as  in  Table  5, 


TABLE  5 


Pm  CtST  Summation*  and  Trial  Time 


Ter  Cent  Separation 

O Judges  \n?h  1 1 use?  ot  fewer* 
Q J ixj^( $ with  more  than  t \ * 

Mn  aTI  r<»K.  uI  injt3r>'  fury  trials  wtvelhtt  of  fret. 

are  kkniintd  by  kltcr  as  ihcy  *j>pc>r  in  l>bk  4. 


Generally,  Tabic  5 shows  a band  of  dois  sloping  downward 
toward  the  right -band  borderline  and  thus  confirms  our  expecta- 
tion that  the  average  trial  length  decreases  as  the  proportion  of 
separate  trials  per  judge  increases.  As  will  be  noted,  the  individual 
joints  representing  the  various  judges  are  far  from  forming  one 
line  which  would  represent  the  relationship  between  separation 


/f 
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and  trial  length;  they  show  great  individual  differences.  These 
differences  derive  fro/n  two  causes.  First,  from  the  fact  that 
some  judges  will  try  their  cases  more  expeditiously  than  others; 
if  so,  tlieir  “points V will  lie  somewhat  lower  than  they  would  if 
they  tried  their  cases  at  average  speed.  Second,  from  the  fact 
that,  although  cases  are  assigned  to  the  judges  at  random,  some 
judges  will  he  assigned  more  time -consuming  eases  than  others. 
The  distorting  effects  of  both  these  factors  arc  automatically  re- 
duced to  the  extent  that  the  number  of  judges  (points)  glows, 
so  that  the  “slower”  and  “faster”  ones  balance  each  other  — pro- 
vided that  there  is  no  consistent  correla'ion  between  “fast”  judges 
and  their  propensity  to  separate. **  The  distorting  factors  ate  also 
reduced  to  the  extent  to  which  the  individual  judges  have  more 
cases  and  hence  are  likely  to  have  on  balance  the  same  average 
assortment  of  long  and  short  cases.  In  line  with  this  reasoning, 
wc  will  disregard  all  judges  who  had  less  than  fifteen  cases.  If 
we  then  look  at  the  five  remaining  Judges,  /,  //,  I),  B,  and  /l,  wc 
find  them  to  fall  nicely  about  the  straight  line  we  have  drawn 
in  Table  5.’*  The  two  crucial  points  along  this  line  arc  its  begin- 
ning and  its  end.  It  begins  (at  separation  zero)  with  3.$  days  and 
ends  at  the  right-hand  margin  (separation  100  per  cent)  at  3.0 
days.  This  line,  the  tmieh-lookcd-for  unbiased  estimate,  suggests 
that  if  the  court  had  tried  all  Us  personal  injury  jury  cases  ur.der 
the  separation  rule,  it  would  have  saved  o.S  days  out  of  3.8  days, 
or  21  per  cent  of  the  total  trial  time. 

IV 

An  additional  tabic  provides  further  insights  into  the  means  by 
which  lime  is  saved  through  the  separation  procc.-s:  Tabic  6 
compares  the  stage  and  mode  of  termination  of  tegular  and  sep- 
arated trials.  The  figures  front  Table  2,  telling  us  at  what  stage 
the  cases  terminate,  form  the  right-hand  margin  column  of  Table 
6;  those  from  Table  3,  mode  of  disposition,  form  the  last  lines  for 
rtgular  and  separated  trials. 


,ltH«  Is  h fact*  t*5vrarxe  rhl  luch  a cortthlfea  dcN  tot  tx A 
lu&cWat  r.uhiVa  of  rioftjury  Irak  fof  euH  |ud£t.  Of  mi  hit 

Iriit  lift*  to  rch  ir,  hijH  ptovSic  a In  tKs  injvtlsnt 
this  artiVd*  <f«>htt*  fr*n  a centre !kd  ctf.:riraei*t  frith  ill  OonccniUnt  teas*iit»f>‘t 
tiial  to  i)i! erratic  Via  htfilerrl  tVf  nsuit. 

11  If  \ht  Erse  biKti  on  afl  i?  itre  fvetive  of  lYr.t  of  Inals 

U a sonHfvS at  slope;  tbit  he  r'jtify  er^'f  paietilhl 

lawn*)  ^1*1  th  a:?,  he  c of  [>i§  inlym,  htrevff.  r'vei  es  g >2 

reason  to  vjHcvi  %JSA  *>e  prefer  3n*  is  Ektfr  to  t*  ckfer  to  t \t  *int  line'’  t>*t 
^irt;fce  frcr.i  a ttt.troDt-i  tt;*n:s:er!. 
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TABLE  6 


Stage  and  Mode  or  Disposition  or  Personal  Injury  Jury  Trials 


Srife  of 
Tir.iJoitlc* 

Fettled 

VerJicI  Ih 

r»rfrtd^t  Jury  VfiJfel 

*■ — — — — - for 

P;r(tte<j  Jury  rn  LMbrl.’ly 

V(r-?i<t  \ eriict  ird 

TOTAt 

A.  Regular  Trials 

During  Plaintiff’s 

Case 

18% 

, , 

. . 

18% 

At  Knrl  of  Plaintiffs 

Case 

3 ft 

t 7v 

, , 

Aft 

During  Defendant’s 

Case 

, , 

, , 

, , , 

After  Full  Trial 

3% 

Sift  A*ft 

78% 

Total 

21% 

3ft 

31%  42% 

100%.  • 

B.  Separated  Trials 

During  Liability 

Trial 

io%« 

9ft 

» » . , 

19ft 

At  End  of  Liability 

Trial 

»5%J 

Aft 

6ift 

Dnrine  Damage 

Trial 

4%* 

* * 

. • • • 

Aft 

After  Full  Damage 

Trial 

ift* 

. . 

nft 

15% 

Total 

32  ft 

13  ft 

13%  ~ 12% 

‘100%* 

1 it;  itUh. 

* f 9 trill?. 

'Stilled  before  jury  rd  Jtftcd  *ny  vtrdkl. 

* Settled  a flcr  }uty  \ cr<_''d  *i‘rr.iirg  liability  but  before  jury  returned  virdirl 

cn  Th»»  Utiline  n f'tr  ,-rnl  (tJ-M-f  j),  ftrrtft.it  ibcs«  In 

v.Mcb  ibe  jury  found  ft r itr  pliinliit  t*ft  liability  but  revtf  deliberated  on  tbt 
rjwli'.ft  f.t  dim»£<s. 


If  we  look  fail  at  regular  trials,  wj  see  that  18  per  cent  of  a!.' 
regular  trials  were  terminated  llirougli  settlement  duiing  ike  plain- 
tiffs case,  and  that  settlement  at  a later  stage  is  much  less  fre- 
cp.icnt.  Then  we  turn  to  the  figures  for  separated  liials.  Here  we 
see  eon  firmed  our  hypothesis  that  the  bulk  of  cases  disposed  of 
at  the  end  of  the  liability  trial  terminate  in  a verdict  denying  lia- 
bility: of  the  f>2  per  cent  terminating  it  that  point,  47  or  roughly 
three-fourths,  end  with  verdicts  for  me  defendant.  Sut  the  un- 
expected element  toncerns  those  eases  in  which  the  jury  finds 
for  the  plaintiff  on  the  issue  of  liability.  We  fir.d  an  increased  will- 
ingness cn  the  part  of  defendants  to  settle  once  they  have  lost  the 
liability  Issue.  Fifteen  per  cent  of  all  trials  end  fhror'gh  settle- 
ment after  the  jury  affirms  liability,  another  4 per  cent  are  settled 
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during  the  damage  trial,  and  3 per  cent  at  the  end  of  the  damage 
trial.  Thus,  of  the  (15  + 4 d-  3 + 12  - ) 34  per  cent  of  a’i 
cases  where  the  jury  finds  for  the  plaintiff,  only  19  per  cent  teach 
a trial  on  tire  damage  issue  and  only  12  per  cent,  or  about  one- 
third,  reach  a second  jury  verdict.  In  short,  separation  may  save 
trial  time,  not  only  in  cases  ending  in  defendants’  verdicts  hat  in 
cases  resulting  in  plaintiffs’  verdicts  as  well.  Once  liability  is 
affirmed  these  cases  too  arc  vety  likely  to  he  settled  and,  hence, 
will  benefit  from  separation  of  the  issues. 

These  figures  explain  why  it  would  be  so  difficult  to  predict  at 
the  time  of  separation  which  cases  are  likely  to  benefit  from  it. 
A prediction  that  the  case  is  likely  to  end  for  the  defendant  is  not 
enough,  because  the  cases  that  go  for  the  plaintiff  on  liability  also 
are  likely  to  benefit  from  separation.  To  select  effectively,  the 
judge  would  have  to  be  able  to  identify  in  advance  the  two  groups 
of  cases  that  will  not  benefit  from  separation.  These  arc  (1)  the 
15  per  cent  of  cases  which  go  through  a full  trial  despite  separa- 
tion, and  (2)  those  cases  whose  early  termination,  though  after 
the  liability  trial,  cannot  be  credited  to  separation,  since  they 
would  have  ended  prematurely  through  settlement  or  directed 
verdict  even  if  they  bad  been  tried  regularly.  The  fust  two  figures 
in  the  right-hand  column  of  Table  6,  adding  up  to  (iS  *f  4 ~ ) 22 
per  cent,  are  a measure  of  the  frequency  of  these  cases.  Wc  now 
appreciate  why  Tables  2 and  5 come  out  so  s’milarly.  The  "pur- 
poseful'’ selection  by  the  judge  is,  at  least  with  respect  to  the 
expected  time  saving,  not  too  different  from  a selection  by  tottery. 

V 

Now  we  may  consider  a third  approach  to  the  problem  of  esti- 
mating how  much  time  is  saved  by  the  separation  procedure.  It 
is  clear  that  the  entire  time  saving  conus  from  avoiding  the  liti- 
gation of  damages,  either  in  cases  where  liability  is  rejected,  or  in 
cases  winch  arc  settled  after  liability  is  affirmed.  We  can  then 
toughly  estimate  the  time  saved  if  wc  kno.v  three  item*:  The  pro- 
portion of  trials  that  go  their  full  course  to  verdict  if  separation 
is  not  available,  the  proportion  of  separated  trials  that  go  the  full 
course  to  a second  verdict,  and  the  share  which  litigation  of  dam- 
ages constitutes  of  the  total  litigation  time  in  a full  trial.  We 
know  the  first  percentage  to  be  6:  per  cent;  M from  Table  6 we 

M Aceonilt?  to  Ait*  fwrnhVI  hy  the  A e of  thj  tn:ui 

Coar:*.  ta  the  t%o*yr*t  f prW  to  tV  Htr-.-d  nXl  n fcf  mil  tuk  si, 

6:ft  ti  injury  jsy  iriiU  ti  t jury  \ *.  Td. 
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learned  that  only  j:  per  cent 58  of  the  separated  trials  went  to  a 
second  verdict.  This  means  that  separation  causes  about  (62  — 
i:  -)  50  per  cent  of  the  trials  to  to  nr.  in,’,  le  without  running 
their  full  course.  As  to  the  share  of  total  litigation  time  devoted 
to  the  damage  issue,  we  must  rely  on  expert  estimates,  although 
theoretically  one  could  produce  an  dual  time  count  on  a repre- 
sentative sample  of  trials.  Conversations  with  judges  and  lawyers 
have  led  us  to  conclude  that  on  the  average  the  damage  issue  con- 
stitutes at  the  most  40  per  cent  of  the  total  trial  time.  Armed 
with  these  figures,  we  can  perform  the  following  computation: 
If  each  of  the  roughly  50  per  cent  of  the  cases  in  which  litigation 
of  damages  is  avoided  involves  a saving  of  40  per  cent  of  trial 
time,  then  the  saving  from  separation  would  amount  to  20  jrer 
cent  of  the  total  trial  lime.  This  result,  in  spite  of  the  roughness 
of  its  basis,  is  close  enough  to  our  other  findings  to  permit  now  the 
firm  generalization  that  in  personal  injury  jury  trials  separation 
saves  hid  time  0}  the  magnitude  of  about  so  per  cent . 

Vi 

An  important  limitation  should  now  be  noted  on  the  amount  of 
time  which  courts  may  actually  save  through  the  separation  de- 
vice. All  calculations  so  far  are  based  on  the  assumption  that  all 
cases  would  be  tried  under  tbc  separation  rule.  Hut,  as  fable  i 
indicated,  in  this  court  only  ,37  per  cent  of  the  personal  injury 
trials  are  separated,  and  hence  the  time  saved  thereby  is  not  20 
per  cent  but  only  37  pet  cent  of  ibis  20  per  cent,  that  is  (.,3/  X 
:o  «)  7.4  per  cent  of  the  total  trial  time.  We  also  know,  how- 
ever, ironi  Table  4.  that  this  average  of  37  per  cent  is  not  the 
same  for  all  judges.  We  know  from  conferences  with  Judge  A, 
who  had  S9  per  cent  of  his  trials  separated,  that  he  ordered  sep- 
aration as  a matter  of  routine,  unless  cause  was  shown  as  to  why 
separAl'on  should  not  be  ordered.  Judges  / and  /,  on  the  other 
hand,  apparently  proceeded  with  tegular  trials  unless  a special 
reason  was  advanced  for  ordering  separation.  We  would  con- 
clude, therefore,  that  10  per  cent  is  a good  enough  estimate  of  the 
proportion  of  cases  which,  under  the  most  liberal  application  of 

11  la  3^*  cf  tK  M^srMrJ  *crt  *cltkd  *f?<r  a luTI  I nil.  Wt  do 

t X 'u&  ti h*  \hi  fee  Iml*  wlJcH  hid  fjfl  trxir  fv3  ccurw,  kw* 
tvif,  At<c  Ifct  fir*re  for  tr^%r  iri*U  r.ot  foctaJ*  trxkd  ia  A srlik* 

vm  « <Fmted  \rrdkl  tfta  xht  difc&difcf#  tnd  nMi  10  x\t  fifom 
* tomptralle  rtn cf  co~**  di^stt)*  x\*x  xht  scluifiy 

fjnild  Iriib  u r.ol  t<hvt  wc*wii!y  Ifle  trials  <^V<Wd  t>i  r?/drra  fet 
thijfc.  But  tVJ*  ttt  t is  to  ti  s.r.iH  «rcs{N  to  U i*t**td  ktr. 
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the  separation  rule,  would  be  left  to  the  regular  mode  of  trial. 
And,  consequently,  wc  conclude  that  if  the  separation  rule  were 
applied  liberally  the  overall  time  saving  would  not  teach  20  per 
cent  but  only  (.90  X 20  18  per  cent. 

VII 

With  this  wo  reach  the  end  of  our  efforts  to  measure  the  amount 
of  trial  time  saved  through  separation.  The  remainder  of  this 
essay  is  concerned  with  factors  that  could  conceivably  operate  in 
the  opposite  direction,  increasing  the  court’s  burden  of  trial  lime.’® 
The  most  obvious  one  might  be  that  the  amount  of  time  needed 
for  the  jury’s  deliberation  in  separated  cases  — where  the  jury 
may  deliberate  twice  — may  be  greater  than  in  regular  trials. 
Second,  separation  could  conceivably  reduce  the  number  of  jury 
waivers,  increasing  the  proportion  of  the  more  time-consuming 


1 * Hcfor«  proceeding,  it  L<  necessary  to  rc|X>rt  on  Uo  minor  housekeeping  do 
lath.  One  concur*  iV.<  churl's  u?c  of  medical  cvpcrls;  l he  other,  cvr 

treatment  of  13  ta'cs  In  which  only  tb.‘  damage  i*suc  was  tried  bc^aube  liability 
was  stipulated. 

At  the  time  th„'  separation  rote  was  Introduced  the  tout  I pul  into  efial  another 
Innovation.  The  judges  In  ihrir  discretion  were  enabled  to  appoint  *n  impartial 
medical  ea;ett  If  th*  adversary  experts  were  too  fat  apart.  Since  it  could  be  argued 
that  such  operts  couli  affect  trial  time  and,  hence,  our  analyst?,  we  give  herewith 
the  tck:ar.t  data.  They  carry  the  strong  suggestion  that  these  open *,  whatever 
thrif  effect,  did  rot  distort  our  analysis  of  trial  separation.  Fourteen  per  cent  of 
the  separated  trials,  and  $ per  cent  of  the  regular  trials,  had  such  experts.  The 
average  length  of  thc.te  trials  was  1.$  dns  for  the  separated  ar.d  5.3  for  the  teguht 
trials.  There  Is  thus  no  significant  de\ ration  from  the  general  pattern.  In  any  evtf.f, 
as  we  have  pointed  Out  elsewhere,  the  primary  effect  of  the  impi  thl  expert  Is 
tfkdy  to  be  Ms  increasing  the  likelihood  of  settlement  before  trial,  bit  ay  tn-rt. 

Concerning  the  13  cases  wiih  stipulated  liability,  the  question  had  to  be  answered 
as  to  wbeth'r  thc.se  stipvhtlons  would  have  taken  place  if  the  reparation  ruk  had 
bot  existed.  Defense  tounrl  might  have  simply  pitfmcd  not  to  kt  the  jury  brow 
Just  how  negligent  his  client  bad  b:cn.  For  purposes  of  Our  statistics  ve  treated 
the  cases  as  follows;  If  the  Juige  separated  the  Issues,  vt  a*m?«KJ  that  adn,U.4en 
of  liability  was  In  fesnonse  to  the  separation  order.  Out  cf  the  ij  cases  where 
lability  was  admitted,  3 fell  into  Ihl*  category,  the  remainin*  to  casts  were 
Coanted  among  the  regyhr  trills  *r.dci  the  assumptron  th at  here  liability  would 
have  been  admitted  even  if  the  case  hid  reached  trial  prior  to  the  adoption  of 
the  sepira'iofi  rule.  This  treatment  probably  corsetdes  ks*  bCverce  to  the  sepm* 
tioft  rule  that  It  deserve*,  the  10  cimt*.  treated  as  reguhf  trials  represent  (10  out 
of  it;  »)  $$  of  all  regular  trial*.  Although  w*t  have  no  s’stistiti  ©n  soil  cases 
for  the  lime  ptkr  to  the  rule’s  adoption,  it  seems  wnlilely  — and  the  judges  hive 
confirmed  this  In  their  Conference*  with  w«~~lhat  this  percentage  was  as  Hah 
as  that  We  probably  err,  therefore,  by  crediting  only  3 cues  of  admMel 
lihihiy  to  the  stpua’ba  uk,  This  error  mill  tend  a**  make  the  average  trial 
time  for  tit*  it  shorter,  and  for  the  scp^ited  care  kr.jer,  tha®i  it  idwaTy 
wu;  h txe,  it  pro!  ably  ain  as  to  wr.de rest Sm’e  the  tavirgs  %Ut!SwUhk  to  k^- 
I ratio V 
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trials  before  juries.1*  Third,  separation  might  increase  the  num- 
ber of  hung  juries,  and  hence  the  number  of  cases  that  have  to  be 
tried  twice.11  Fourth,  separation  might  reduce  the  proportion  of 
cases  settled  prior  to  trial,’*  and  thereby  increase  the  number  of 
cases  that  need  to  be  tried.  The  suspicion  that  separation  of  issues 
may  lead  to  an  overall  slackening  of  trial  speed  has  already  been 
laid  to  rest,  especially  by  Table  a,  which  showed  that  the  separated 
trials  that  go  to  two  verdicts  do  not  last  longer  than  full  regu'ir 
trials.  We  turn  now  to  the  investigation  of  the  possible  offsets 
enumerated  above. 

A.  Increased  Time  for  Jury  Deliberation I 

The  first  point  is,  in  any  event,  one  of  relatively  minor  impor- 
tance, since  jury  deliberation  time  is  not  necessarily  court  time. 
Nevertheless,  it  will  be  of  interest  to  see  the  pertinent  figures  in 
the  following  table. 


TABLE  ? 

Average  Jvrv  Deuuerchon  TtME  and  Frequency 


Tlrre  X 

Ter  Ort  c4 
Trtb  It 
Miefc  Jerr 

\W{M«4 

Separated  Trials 

Liability  Ts^ue 

t.j  hours* 

71% 

t-9i  hours 

rhuv.ge  U<ue 

3.4  hours 

u% 

0.4 1 hours 

Total 

2.33  hours 

ilegnlar  Trials 

Both  T*$uc$ 

3 7 hours 

73% 

2.70  hours 

' Th:$  l«  tb 

f.pirf  for  ill  Iriab. 

In  tmb  uhere  th*  jury  actually 

«?oib<ritc<i  IwKf,  it  tool;  fcr.ly  i.i  for  the  Jury  lo  liabiBiy. 


Thus  we*  see  that  the  jury  in  separated  trials,  if  it  deliberates 
twice,  spends  an  average  of  (3.4  + ».*=*)  5.6  hours  in  the  juiy 
room  as  against  an  average  oi  only  3.?  hours  in  regular  trials.  This 
could  be  an  offset,  except  that  in  only  n per  cent  of  the  separated 
trials  the  jury  was  called  upon  to  deliberate  twice. N Tire  last  col- 
umn of  the  table  takes  into  account  the  frequency  of  deliberation, 
giving  an  average  deliberation  time  for  all  cases,  including  those 


Wiillftfifl,  Kcv*;*e  Hijvretlh*  t)  Jury  fritU:  ,4*  EiinfU 

*/  tht  Cu  <7  RtJe  X /«*/«{  fesfv r,  14  \*.w®.  L.  *Uv.  Su 

19  !Usv*  t,.  fcrr.  :$i,  ;$i  (101). 

\V«V«!(!ft,  nfn  ftofe  i;t  ti 
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in  which  the  jury  deliberated  only  once  or  not  at  all.  With  de- 
liberation time  thus  weighted,  the  effect  of  separation  is  favor- 
able rather  than  unfavorable;  the  table  reveals  that  the  jury 
spends  on  the  average  0.37  hours,  or  about  22  minutes,  less  de- 
liberation time  in  the  average  separated  trial. 

II.  Decreasing  Frequency  0 ] Jury  Waivers? 

On  the  second  point,  the  frequency  of  jury  waivers,  the  evi- 
dence is  equally  reassuring.  There  is  no  ground  for  believing 
that  the  proportion  of  demands  for  juries  Increases  under  the 
separation  rule;  in  fact  the  evidence  seems  to  point  the  other  way, 
though  the  difference  is  negligible.  During  the  two-year  period 
before  separation,  juries  were  used  in  90  per  cent  of  all  personal 
injury  cases; ri  during  the  concsp  Hiding  period  after  separation, 


in  only  $6  pc 

r cent.*7  Furthermore,  i 

f separation  w 

ere  to  reduce 

jury  waivers, 

one  would  expect  the  judges  who  are  more  likely 

to  order  separation  to  show  a lower 

incidence  of 

jury  waivers. 

but  Table  S, 

which  presents  the  rclcv; 

int  data,  show 

s no  evidence 

to  this  effect. 

TAllLE  8 

Sr.rAR aiios  Ratio  and  Jt 

RY  Waiver  ts 

Personal  Injury 

TRIMS 

NtTrbtf  H 

1 K f ni  pi  v 

Tre4  VitJci  Rt!e  ti  * 

1 ft  Vfr\ 

N *h  tr 1 

Tr  a’§  t< 

T hr»  1 

A 

ti 

n 

(33) 

B 

Si 

9 

(45) 

C 

4$ 

9 

(n) 

I) 

J6 

16 

(3‘) 
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*9 

0 

(7) 

F 

«? 

»7 

(6) 

G 

M 

»4 

(?) 

Jl 

It 

18 

(*?) 

I 

6 

K o 

(33) 

J 

7 

0 

04) 

K 

0 

0 

(«) 

L 

0 

0 

(») 

• lh<<t  ntk*  (rom  \t\  TMt  4 beau**  Ibey  ire  bifvJ  eft 

il  trials  cof.  jtK'fti  by  r>l  tt,]y  on  big  jury 


C.  Increase  in  Hung  Juries? 

As  to  the  third  possible  offset  against  the  savings  achieved  in 
trial  time,  a possible  increase  in  the  number  of  hung  juries,  they 

*'  Stitiok*  fifplH  V Affti-Biieht  <y=<f  t!  tV<  I'.-iroJ  S:  n*.»  C-nt' • 
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r.,'c  as  rare  an  event  r.ow  r.s  they  were  prior  (o  (he  new  rule.1* 
I’ut  even  without  such  specific  evidence  it  would  seem  unlikely 
that  any  major  offset  muld  be  expected  from  this  source.  Even 
if  the  ordinary  3 per  cent  rate  of  mistrials  had  been  doubled  under 
(lie  new  rule  (which  it  was  not),  the  offsetting  loss  of  trial  (ime 
would  be  only  about  1.5  per  cent,  since  a hung  jury  is  a total  loss 
only  in  about  half  of  all  instances;  in  the  other  half  (he  litigants 
take  the  jury's  hint  and  settle,  so  that  there  is  no  need  for  a retrial. 

D.  Decrease  in  Settlement  Ratio ? 

On  the  fourth  issue,  the  settlement  ratio,  the  evidence  is  given 
in  Table  9,  which  shows  that  the  proportion  of  cases  reaching 
trial  has  not  changed  ?t  all. 


TABLE  9 

SttttxMENi  Ratio  ts*  Jury  Cases  RrroRf.  and  After  Rule  it 


T*c-Y<ir  r#r>4  r<fo*e  T»>-Yui  Ttr'cvl  W:\1b 


Tcf  Cent 

Ttr  CmjI 

t’cfote  Trial 

TiTT 

? t ~ 

“»r 

1X9$ 

Reached  Trial 

»»? 

»5  t't 

216 

»5 

Tola]  JVr*/»ml  Injury  Jury 
Cases  riled 

U3S 

too.ofc 

M33 

lOOO/t 

That  the  separation  rule  is  not  likely  to  affect  the  settlement 
ratio  can  also  be  appreciated  on  a priori  grounds.  The  likelihood 
that  patties  will  settle  is  not  a function  of  the  relative  strengths 
of  their  jrositions,  but  rather  of  the  clarity  with  which  each  secs 
the  other's  position,  since  a settlement  rubes  when  each  party 
can  weigh  the  other's  chances  and  agree  with  him  c*n  what  those 
chances  are.  Whatever  possible  difference  the  rule  could  make 
to  the  verdict  expectation  of  the  litigants,  there  is  no  reason  to 
believe  that  such  a difference  would  bo  more  visible  to  c-ne  side 
than  to  the  other.  Wc  should  expect,  therefore,  that  any  possible 
difference  would  leave  the  litigants  as  likely,  or  as  unlikely,  to 
settle  as  they  had  been  before.*' 
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VIII 

Although  our  inquiry  hos  tnl-.cn  us  far  afield  and  into  several 
side  forays,  the  lash  has  been  a narrow  one  — to  measure  the 
effect  of  separation  of  issues  on  the  court’s  trial  load.  This  then 
is  a summary  of  our  findings: 

Separation  of  issues  will  save,  on  the  average,  about  jo  per 
cent  of  the  time  that  would  be  required  if  these  cases  were  tried 
under  traditional  rules.  This  saving  derives  from  the  fact  that 
in  many  cases  separation  makes  the  litigation  of  damages  un- 
necessary. This  group  includes  all  cases  in  which  liability  is  de- 
nied, but  also  the  majority  of  cases  in  which  liability  is  affirmed, 
because  two  out  of  three  of  these  cases  arc  likely  to  be  settled 
without  trial  of  the  damage  issue.  Thcic  is  no  evidence  that  this 
saving  is  offset  by  a change  in  the  settlement  ratio  prior  to  trial, 
In  the  frequency  of  jury  waivers,  or  in  the  proportion  of  hung 
juries,  any  one  of  which  factors  — if  affected  — could  increase 
the  court’s  (rial  load. 

It  is  not  possible  to  sort  out  effectively  in  advance  the  cases  in 
which  separation  would  prove  futile.  Therefore,  if  a court  wants 
to  realize  the  maximum  of  potential  time  saving  through  separa- 
tion, it  should  separate  as  frequently  as  possible.  The  time  saved 
will  be  in  direct  propoition  to  the  frequency  of  separations.  Such 
a policy  recommends  itself  also  on  the  ground  that  separation  is 
unlikely  ever  to  add  substantial  trial  time  even  if  cases  should  go, 
as  some  do,  to  a second  verdict.  If  a court  see  re  to  follow  Judge 
example  and  order  separation  routinely  unless  cause  is  shown 
to  the  contrary,  it  could  expect  separation  in  about  90  per  cent  of  all 
cases.  On  this  level  of  separation  the  overall  saving  could  be  ex- 
pected to  amount  to  roughly  rS  per  tent  of  the  couit's  trial  lime. 

Viewed  superficially,  the  route  which  look  us  to  this  result 
might  seem  overlong.  Yet  such  is  the  nature  of  the  social  sciences 
that,  short  of  a controlled  experiment,  all  their  evidence  is  imper- 
fect; only  through  a variety  of  converging  approaches  can  one 
hope  to  reach  safe  ground.81  Hal  from  our  investigation  we  have 
learned  that  separation  is  a powerful  remedy  for  court  conges- 
tion. Kully  used,  it  would  be  equivalent  to  increasing  the  number 
of  judges  trying  those  cases  by  one-fifth.  Ca*es  winch,  if  not 
separated,  will  require  five  iudges  for  their  disposition  will,  if 
separated,  require  only  four  judges  and  free  one  for  other  work. 

The  introduction  of  the  separation  rule,  as  we  bad  occasion  to 
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observe,  raised  other  questions  beyond  i I s efficacy  as  a delay  rem- 
edy. These  questions  concern  the  possible  effect  of  separation 
on  the  substance  of  the  jury  verdicts.  By  depriving  the  jury  o* 
Sts  joint  verdict,  it  is  argued,  subtle  influences  operate  to  affect 
the  verdict,  and  should  therefore  be  included  in  an  overall  ap- 
praisal of  the  rule.  We  are  now  in  the  process  of  collecting  data 
which  might  have  a bearing  on  this  problem;  if  findings  prove 
significant,  they  will  appear  in  a subsequent  report. 


QUALITATIVE  ANALYSIS 


The  largest  share  of  our  attention  has  been  devoted  to 
quantitative  data  rather  than  qualitative  data.  Our  emphasis 
has  been  due  more  to  the  complexity  of  quantitative  analysis  than 
to  Judgments  about  the  value  of  qualitative  versus  quantitative 
materials.  In  fact,  we  feel  that  qualitative  material  is  valuable 
to  the  social  sciences  and  that  any  discussion  of  social  science 
methodology  would  be  lacking  if  the  student  were  not  made 
aware  of  its  value. 

hi  general,  qualitative  research  Is  used  in  two  capacities: 
(1)  it  serves  as  a first  step  toward  quantification  by  discovering 
"unexpected  phenomenon  which  stimulate  a search  for  causal 
explanations;"  and  (2)  it  offers  the  researcher  a different  kind  of 
approach,  a "substitute  for  statistical  research  making  the  same 
kind  of  statements  but  on  the  basis  of  recording  and  analysis  . , . 
which  takes  place  largely  within  the  mind  of  the  observer. " The 
following  paper  by  Allen  Barton  and  Paul  l^a/arofeld  is  an  ex- 
cellent statement  of  the  role  of  qualitative  analysis  in  social 
research  and  reveals  its  great  importance,  The  great  bulk  of 
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man's  behavior,  of  course,  is  still  far  from  being  easily  measur- 
able— if,  indeed,  many  of  its  most  significant  aspects  can  ever  be 
measured  in  the  traditional  sense. 

There  is  one  particular  aspect  of  the  following  selection 
which  is  particularly  worth  noting.  Qualitative  data,  like  quantitative 
data,  can  be  systematically  ordered  and  analysed  even  though  it 
does  not  offer  the  same  precision.  As  Barton  and  Lazarsfeld  point 
out,  the  researcher  must  organize  the  raw  observations  into  a 
descriptive  system.  These  descriptive  systems,  as  the  authors 
indicate,  can  range  from  crude  lists  of  types,  each  defined  indi- 
vidually to  fully  systematic  typologies  in  which  each  type  is  a 
logical  compound  of  a small  number  of  basic  attributes.  What  all 
such  typologies  have  in  common,  irrespective  of  their  complexity, 
is  that  from  them  one  can  derive  a classification  of  the  values, 
habits,  and  attitudes  which  are  important  to  the  explanation  of 
social  behavior.  Thus  Louis  Wirth's  typology  of  the  Jewish 
community,  involving  the  social  roles  of  Mensch,  the  Allrightnick, 
the  Schlemihl,  the  Luflmensh  , the  Ycshiba  Bochar  , and  the  Zaddiok, 
give  us  as  complete  an  index  as  any  obtainable  of  the  culture  traits 
and  the  culture  pattern  of  the  group. 
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From  Allen  H.  Barton  and  Paul  F.  Lazarsfeld,  "Some 
Functions  of  Qualitative  Analysis  in  Social  Research," 
Frankfurte  r B e i t r a g e Z u r S oz  i ol ogle 
(Band  1,  1955),  pp,  321-361.  (Reprinted  from  Bobbs- 
Merrill  Reprint  Series  in  the  Social  Sciences,  #S-336, ) 
Reprinted  by  permission. 
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1 he  advancement  of  research  procedure  in  toeinl  science  as  elsewhere  depend?', 
on  mailing  explicit  wlm  researchers  actually  do,  ami  systematically 
a naly i n g it  in  the  light  or  logic  and  of  substantive  knowledge*  Such  a "codi- 
fication” of  procedures  points  out  clangers,  indicates  neglected  possibilities,  and 
suggests  improvements.  Ic  makes  possible  t!ic  gcirerafr/atioa  cf  methodological 
knowledge  - its  transfer  from  one  specific  piojert  or  subLei  matter  to  other?, 
from  one  researcher  to  the  community.  finally  it  rmdus  possible  a more 

systematic  training  of  sce.le-r.i,  in  place  of  s’mply  expo >iug  them  to  concr.tc 
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fact  hive  been  del up  ^ Ah  qualitative  material.  About  ICO  studies  were  culled 
for  characteristic  examples.  A:1  a v.s : then  made  to  organize  these  ca-e>  in 
order  that  the  most  charuc. eristic  types  of  qualitative  work  could  be  distin- 
guhlud  unJ  documented.  This  piper  presents  the  resulting  organization. 

The  reader  will  have  no  difficulty  in  noting  that  this  “guide  through 
qualitative  research”  is  ::ce!r  guided  by  proceeding  from  simple  to  over  more 
complex  piwco  jre>.  Vc  begin  with  a discussion  of  the  value  of  simple  obser- 
vations. Vic  then  proceed  to  those  studies  which  center  on  ordering  and  classi- 
fication. Our  next  group  of  example*  demonstrates  the  various  ways  in  which 
several  variables  arc  interrelated  through  qualitative  analysis.  Next  we  discuss 
cases  where  the  analyst  wants  to  encompass  such  a great  number  of  dimensions 
that  he  cannot  make  them  ill  explicit,  but  tries  to  sum  them  up  in  a general 
“pattendk  This  is  probably  the  point  a:  which  qualitative  research  is  most 
creative,  most  controversial,  and  most  difficult  to  describe*  It  will  be  seen  that 
we  had  to  use  a spec’.1.!  term  (“matrix  foimula”),  to.bring  into  relief  tins  means 
of  seeing  the  social  world  in  a new  way.  Finally,  we  .ouch  on  the  role  of  quali- 
tative data  in  the  support  of  theory,  a topic  so  large  that  we  did  not  dare  to 
pu  rs u e i t : o a r.  y e x : e n t . 

It  should  also  be  kept  in  mind  what  this  paper  does  no:  attempt  to  do.  } u> t 
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Wc  have  also  not  dealt  with  the  problem  of  evidence.  Under  whu  conditions 
in  the  social  science:  an  as- :rtio:i  is  proved  i > a very  divheult  question,  not 
restricted  to  q nalitative  research.  I;  seemed  best  not  to  touch  on  this  issue,  in 
a context  in  which  all  emphasis  was  placed  on  providing  a picture  of  a kind 
of  work  which  is  usually  considered  so  “private”  that  it  defies  all  systematic 
presentation. 

I,  ANALYSIS  Of  SINGLE  OBSERVATIONS 

When  one  examines  qualitative  reports,  one  of  the  fir^t  types  of  material 
which  catches  our  attention  is  the  “surprising  ohservarion’M.iltethenets  of  deep- 
sea  explorers,  qualitative  studies  may  pull  up  unexpected  and  striking  things 
for  us  to  gaze  on,  Wc  find  that  there  are  people  who  believe  that  they  arc  being 
educated  by  the  unrelated  and  trivial  information  presented  h/  quiz  shows'. 
Interview’s  with  people  deprived  of  their  newspaper  by  a strike  disclose  that 
some  do  not  turn  to  alternative  sources  of  news,  but  to  reading  anything  which 
is  lying  around  the  house;  a major  function  of  newspaper  reading  seems  to  be 
simply  to  fill  in  “gaps’*  in  the  daily  routine'1.  Observers  of  the  under  world  tell 
us  that  professional  thieves  constitute  a rather  exclusive  social  group,  with 
exacting  standards  or  membership  strongly  reminiscent  of  those  of  lawful  pro- 
fessions'*. Anthropological  data  of  course  are  full  of  surprising  observations! 
that  Fskimos  lend  their  wives  to  guc.us  * dthout  any  jealousy,  that  Fiji  Islanders 
kill  their  chiefs  when  they  grow  old,  and  so  on4. 

These  phenomena  are  of  various  levels:  some  are  individual  beliefs  and  be- 
haviors, some  are  a matter  of  group  standards  and  structures  within  a society, 
some  involve  the  norms  of  a whole  culture.  In  each  ewe  the  qualitative  research- 
er lias  simply  disclosed  that  such-and-such  a phenomenon  exists.  And  in  one 
way  or  another,  to  be  told  that  such  things  exist  has  a strung  impact  on  the 
reader.  They  all  have  an  element  of  surprise. 

In  the  next  few  pages  we  will  try  to  clarify  what  this  impact  is,  and  what 
research  functions  are  served  by  these  qualitative  observation*  which  simply 
state  the  existence  of  something  surprising.  We  can  distinguish  at  least  two 
different  uses  for  such  observations,  l ies:,  the  cx’stence  of  a phenomenon  may 
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raise  problems  - me.:  js,  compel  us  to  look  tor  explanations,  to  explore  its 

consequences,  to  try  to  fi;  it  into  our  <Ji.iue  of  knowledge.  Second,  we  irny  had 
in  tlie  Liu a.lfiulve  observation-;  an  hidicuor  of  s . * . . ; c general  variable  which  v e 
want  to  study,  bv.t  cannot  me  mure  directly. 

IV  /ck/;  Ahme  Problems 

Some  observations  -are  surprising  because  they  con  filet  with  our  expectations 
cither  common -sense  or  t!ieo:etice.!!y  derived.  Other  observations  surprise  us 
by  bringing  to  light  phenomena  which  are  simply  new  and  unexplained,  win;1; 
challenge  our.  curiosity.  Yet  another  important  type  of  problem-raising  ob- 
servations is  that  winch  brings  together  under  a clear  le.be!  a body  of  “familiar” 
experiences  which  had  not  previously  been  seen  as  a definite,  generally  occur- 
ring social  phenomenon  - which  forms  for  the  first  time,  so  to  speak,  a “social 
object”  to  be  studied. 

In  any  of  these  cases  die  result  is  that  a pro!  !cni  is  raised.  Our  attention  is 
focussed  o:i  a phenomenon,  and  v-.v  are  stimulated  to  see!;  explanations  and 
inquire  into  consequences.  ’Jo  make  such  a problematic  observation  is  to  initiate 
a research  process  which  may  lead  to  significant  advancement  of  our  under- 
standing o‘  social  phenomena.  (Some  kind  of  observation'-  no  doubt  raise  more 
significant  questions  and  lead  to  more  valuable  hi  dings  than  others,  li  would 
be  of  great  value  to  develop  “screening  principled’  which  can  direct  our  atten- 
tion to  the  more  significant  of  the  surprising  observation;;  this  caimot  however 
be  gone  into  here1.) 

To  give  concrete  meaning  10  the  nod  an  of  a problematic  qualitative  obser- 
vation, and  u>  provide  materia!  for  its  further  development,  a number  of  cx- 
ample*  can  be  gi ecu. 

A>  Is  well  known,  the  original  experiments  oi  the  NYcstcrn  Id.ectric  re?eitxhers 
led  to  highly  surpiising  quantitative  results:  the  ey.periir.i uta!  change*  it:  phy 
si  cal  conditio:;.*  of  work  in  no  way  accounted  tor  due  changes  in  production  in 
the  experimental  group  or  workers.  Ac  tins  point  the  researchers  de  cided  to 
go  b;ck  to  the  Nay  first  stage  of  the  r search  p recess,  read,  simply  gather  ob 
servarions  about  v.  ha:  goes  on  in  a no. null  working  group  in  a f actory.  ’Ih’S 
exploratory  rx search  turned  up  a number  of  surprising  quabtati*. e ob-erva* 
tionsk 

Some  v.*o;!l  groups  v are  char.urerii v d b>  ‘‘,i  lads  of  ambition  and  initiative  end  a 

cnmyV. dedr:  i«.»  !.•;  % el!  cnccgr,  .done”: 

t A dbev '?* Vi  t.’f  ?Yl;  p ; av-.t;  k 'C.*.r.  ,n  5 d '*~i  ;.rq  r.rd  Varied  S'f.  ::e*e, 

Gkn.  e.  I’.'i:  T 4*:  Ch.wv:  liiPi!.:  I7ii::i,;rC  i ; .t.-uxe  . r.  Sve: ; .0.;*.'  d i 

1 i J.  ’go.tHob.'r;  r :uj  Yihb.n  j.  I-Y,;or.,  M ir  ig;ew.:  ^ ^-k;i 'Ce.' u ’g, 
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" 1 he  supervisory  control  vh;ch  is  see  up  by  man '.genunt  to  regulate  and  gowr.i  the 
workers  exercises  little  Authority  ...” 

‘'Jbey  (the  Ci  jployee>)  finely  believe  that  they  vi!I  no:  be  satisfactorily  remu- 
nerated for  any  additional  *.v  orU  they  produce  over  the  bogey  . . 

All  of  these  frets  were  in  conflict  with  v/lt-u  the  management  ard  the  researchers 
had  expected.  By  following  them  up  the  researchers  were  led  to  their  now  well- 
known  discovery  of  the  importance  of  informal  groups  in  foimal  organiza- 
tions, and  of  the  harriers  to  convnur.ication  which  exist  between  levels  vrkhin 
organizations, 

Communications  research  offers  many  examples  of  surprising  qualitative 
observations.  \Yc  have  already  mentioned  the  discovery  of  unexpected  motives 
- for  listening  to  quiz  programs  and  for  reading  newspapers.  Unexpected 
responses  to  communications  are  another  important  example.  A broadcast 
vzarning  the  public  to  patronize  X-ray  operators  and  avoid  “quacks”  left 
some  listeners  afraid  of  any  X-ray  treatments,  and  others  doubting  whether 
there  could  be  any  X-ray  machines  in  the  hands  of  incompetent  operators1. 
A film  designed  to  impress  Americans  with  the  British  v/ar-eflort  left  some  move 
convinced  than  ever  that  America  was  bearing  all  the  burden*.  The  discovery 
of  such  anomalous  responses  led  to  more  detailed  investigations  of  the  com- 
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deed"  in  a P^paganda-wary  society  - in  this  case,  Kate  Smith's  presumed 
strain  and  sacrifice  in  n.a.k'iig  the  IShour  h:oadcast. 

The  c>  rumples  so  far  h ive  involved  uuc'-vveted  plv.T.onv.na  which  stimulated 
a search,  for  causal  explanations.  Jn  other  instances  the  problm*  which  i>  raised 
is  in  the  other  diicction  - an  investigation  of  the  consequences  of  a certain 
phenomenon  is  stimulated.  Thu*  one  researcher  interested  in  problems  of  the 
profession  noted  an  "obvious"  fact  a*  raisin.1;  a problem:  while  all  professionals 
meet  a certain  proportion  of  failuics,  the  trial  lawyers  as  a group  necessarily 
lose  half  their  cases1.  What  must  be  the  consequences  of  such  a high  rate  of 
failure  for  these  professionals  and  how  do  they  eh:  d with  it?  The  answers  to 
these  problems  might  throw  light  on  some  important  problems  of  the  pro- 
fessional role.  The  ability  to  take  a commonplace  fact  and  see  It  a*  raising 
problems  is  important  because  it  can  lead  ultimately  to  such  enlightenment. 

Another  such  observation  was  made  by  Merton  in  his  study  of  the  Kate 
Smith  wat-bond  marathon.  In  the  content  of  the  broadcast  there  was  no  re* 
fcrcnce  to  the  real  economic  purpose  of  war  bond  buying  as  an  anti-inflation 
measure.  Merton  saw  this  fact  as  raising  problems  of  consequences:  an  oppor- 
tunity to  educate  a large  and  attentive  audience  in  economic  realities  had  been 
neglected,  and  reliance  had  been  placed  instead  upon  "large  delusive  state- 
ments" playing  upon  the  anxieties  of  those  with  loved  ernes  over.-eas.  What 
were  "the  further,  more  remote  but  not  necessarily  less  significant  effects  of 
these  techniques  upon  the  individual  personality  and  the  society?”  "Does  the 
unelaboratcd  appeal  to  sentiment  which  displaces  the  information  pertinent  ;r 
assessing  this  sentiment  blunt  the  critical  capacities  of  the  listener*?"-' 

An  example  of  the  forming  of  comm*' mpb.ee  experiences  into  a clear!*/ 
labelled  social  phenomenon,  and  thereby  creating  a new  object  for  investiga- 
tion, i>  Adler's  formula* ion  o:  the  concept  ot  the  infciioricv  complex*.  livery- 
one  at  one  time  or  another  has  experienced  feelings  of  being  inadequate, 
unworthy,  etc.,  but  until  the>c  private  sensation*  had  been  pointed  out  and 
labelled,  they  could  not  be  investigated  by  social  science.  Other  examples 
which  might  be  called  to  mind  are  Veblerfs  formulation  of  the  concept  of  con- 
spicuous cor.MUiiptienk  or  Sutherland's  1 -.belling  of  cert. on  categories  of  bu>:- 
hess  behavior  as  "white collar  crime*'5.  Without  having  actually  dbeov  ered  any 
new  facts,  simply  by  directing  attention  to  familiar  facts  placed  for  the  first 

1 v*.\_uu*r  Thi.'jns,  r,*<o  ;,f  i * ColemU.i  l Vdvcr^y,  V:?u::  ,-r;  of 

J Nurmn,  M.vi  PcriyisY-n.  !.  e.,  pp.  v.-g. 

1 A Mu*  AiYr.  Th-  N^r.mc  (V.*tb;:,a  N ■ v Y-r-.  PO*. 

1 T;i-:r i-i-'i  r Yv’-Yn,  *l : : Ta.vry  o;  rY;  U\>s:  C Urn  V.jc.rn  ViVn;>.  N. Yo;k 

1 i Jwi : ii  Ce’iY.i'J,  Y !;;c-Co!;ir  C:;;iur,.  i!: . . /.r.'.-rY-.n  d mvYv, 

Vo],  V,  lYureJj  ) 1 > ’ t , \ . . 1,  *v>.  t <c].;  - Wrc  CY.Y r ( re:.-,  NU".  Yvrk 
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time  within  a distht  rtive  category,1  these  investigators  v'Cr-r  .'hie  to  raise  im 
portant  p rot' Ictus  and  initiate  fnutttil  study. 

The  reader  may  have  noticed  that  some  of  the  examples  of  "surprising  re- 
servation*” cited  are  no  surprise  at  all  to  him.  This  is  as  it  should  be.  1 ne  point 
is  that  one  time  they  were  surprising,  and  initiated  tuttner  in'  estimation 
has  been  sufficiently  successful  to  render  them  familiar  and  understandable 
todav.  Another  problem  is  tliat  an  observation  nay  be  surprising  to  the  par- 
ticular researchers,  while  other  social  scientists  have  known  about  it  all  along. 
Thus  the  Western  Electric  researchers  made  the  “surprising  discovery”  that 
informal  social  organization  existed  among  workers.  Ocher  sociologists  naJ 
long  been  aware  of  the  problem  of  informal  groups;  however,  the  special  pre- 
conceptions of  American  industrial  sociology  up  to  that  time  had  kept  it 
unaware  of  this  whole  realm  of  phenomena.  In  a still  more  extreme  case,  a 
finding  may  be  new  only  to  one  particular  researcher;  m winch  case  it  might 
better  be  termed  “self-education  than  a scientific  discovers. 

Is  there  anything  which  a researcher  can  do  toward  making  ‘‘surprising 
observations”  other  than  to  maintain  an  alert  state  of  mind?  It  may  seem 
contradictory  to  speak  of  giving  instructions  for  making  surprising  observa- 
tions. However,  there  are  ways  in  winch  one  can.  r.t  least  increase  tne  pro 
ability  of  making  such  observations.  Some  of  these  are  discussed  by  Jahoda. 
Dcvtrch,  and  Cook  under  the  heading,  “The  Analysis  of  Tnsight-Snnnilatmg 
Cases’.”1  Strangers  or  newcomers  to  a community  or  a country,  it  is  suggested, 
may  be  able  to'pkk  out  problematic  facts  which  are  simply  taken  for  granted 
by  those  accustomed  to  the  locale.  Marginal  individuals,  or  case.  :n  .raruit'em 
from  one  stage  or  status  to  another,  may  present  much  more  clearly  certain 
problematic  features  of  a personality-type  of  social  sy.-tem.  Deviants,  extreme 
cases,  or  “pure,  ideal-typic.il"  cases  may  have  a relatively  high  ett  ciency  in 
indicating  problc.nr.tic  frets. 

Oiserv.Wc?:*  X('!:ich  Sc^ic  Iruhc.itors 

The  lust  tvpe  of  surprising  observations  discussed  were  those  whicn  v-ere 
anomalous  and  unexplained,  which  served  the  function  of  stimulating  a searca 
for  explanations.  Another  type  of  qualitative  onjcrvatiorts  are  d: alleging 
because  we  see  in  them  indications  of  some  la  tgs -scale  ph.-.-n.  a e .e  . no.  v - 
cannot  perceive  directly.  Thus  the  occurrence  of  rot;  and  protest  meet. r.gs  in 
the  Nci'th  define  the  Civil  War  serves  as  r.r.  indication  tin 
coin’s  v.v-  ’•  Of  shell  rr  rottc-rv  found 
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routes  oi  trade  And  eulrural  contact  in  the  prehistoric  woild;  i peculiar  mili- 
tary custom  indicates  the  caste-like  nature  of  army  organization;  modes  of 
speech  may  indicate  complex  mental  patterns  or  cultural  emphases. 

Throe  situations  can  he  pointed  out  in  which  one  pays  particular  attention 
to  qualitative  indicators.  They  are  distinguished  in  terms  of  the  kind  of  ob- 
stacle which  prevents  direct  observation  and  measurement  of  the  underlying 
variable: 

(1)  Situations  in  which  qualitative  evidence  substitutes  for  otherwise  simple 
statistical  information  relating  to  past  ages  or  inaccessible  countries. 

(2)  Situations  in  which  qualitative  evidence  is  used  to  get  at  psychological  data 
which  are  repressed  or  not  easily  articulated  - attitudes,  motives,  assumptions, 
frames  of  reference,  etc. 

(3)  Situations  in  which  simple  qualitative  observations  are  used  as  indicators  of  the 
functioning  of  complex  social  structures  and  organizations,  which  are  difficult 
to  subject  to  direct  observation. 

The  underlying  assumption  in  all  these  cases  is  that  a phenomenon  which 
cannot  be  directly  observed  will  nevertheless  leave  traces  which,  properly  inter- 
preted, permit  the  phenomenon  to  be  identified  and  studied.  A great  historical 
movement,  a basic  personality  characteristic,  an  essentia!  characteristic  of  or- 
ganizational structure,  should  all  leave  their  imprint  on  a!mo;r  any  docu- 
mentary material,  accounts  by  observers,  or  even  physical  refuse,  which  they 
leave  behind. 

Examples  of  the  first  class  (qualitative  substitutes  tor  unavailable  statistical 
or  descriptive  material)  would  include  the  use  of  newspaper  stories  or  other 
contemporary  records  of  public  demonstrations  as  indications  of  public  opinion 
in  past  times  of  crisis1;  Frazer's  use  of  advertisements  for  slaves  in  ante-bellum 
Southern  newspapers  to  find  out  about  the  suucture  of  the  slave  family*;  the 
use  of  reports  of  refugees,  Soviet  press  materials,  and  the  contents  of  Soviet 
literature  to  provide  information  about  life  in  the  Soviet  Union1;  the  use  of 
archeological  remains  to  indicate  culture  contacts  or  religious  beliefs  in  pre- 
historic times. 

Examples  of  the  second  class  (qualitative  indicators  of  psychological  vari- 
ables) include  forma!  projective  testing,  the  psychological  analysis  of  personal 

1 Per  i nyr,b;r  of  cwvrphs,  see  Ta'j!  1:.  l.«ar*fe!d#  The  Obligator.;  of  tli?  1 95 C Polhtc? 
rathe  1 9> 4 Hhr>rki:t,  in:  Pub!:  - Opirlxi  Qu.:rt£rJy,  Vo!.  14,  No.  4 (Wjr.rjr  3 530-51). 

J } . \:.  I raA:;,  Th«?  N*:;m  I -tmiv  :r*  the  UrtrteJ  States,  CECum  1^39,  r>;».  55  *c.;.  This 

arv.l  r;  dE:-i*s:J  Its:  J?cb:r:  To Training  GTd;  on  t'\;  Q-idmvbc  Vs:  of 

D'cunur.tary  N;u  Verk,  Ihtr-ej  of  A;**!!:,!  Sods!  K-.ar.h.  H3C; 

. 5 l or  Jr-  Sc  i;:  Polnijs:  th:  I31C\:vli  o:  ?ow-*r,  C.'Av,'i;i2$t\ 

P5C;  A?e*:aniicr  G:r»  !i:nT%r  n.  A NV'  * ^ $ ',vr,v  of  Keener iv  Ir.;f.r;,*n:i-\n  on  Sovjft 

in:  An:?fic5n  ?r.  I 1C  >;  l.  *ror:2n  Rtvlj.v,  Vol.  VA  (Fcb.isry  1553% 
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documents  or  artistic  works,  the  amilvsis  of  items  o:  liter  attire  or  entertainment 
as  presumed  projections  of  the  traits  of  their  audience**,  arid  of  course  the 
analysis  of  qualitative  interviews  or  records  of  participant  observation.  A good 
example  of  the  uie  of  *-  heritors  for  a psychological  concept  is  found  in  the 
study  of  anti-prejudice  cartoons  mentioned  earlier: 

“In  tracing  the  process  through  which  these  68  respondents  arrived  at  their 
misunderstanding,  we  find  our  starting  point  in  the  fact  that  mas:  of  them  identi- 
fied with  Mr.  Bigou.  (Footnote:)  By  ‘identification*  we  mean  the  mental  process 
through  which  a subject  assumes  the  role  of  another  person  to  such  an  extent  that 
actions,  either  wb.il  or  behavioral,  directed  toward  the  object  of  identification 
are  experienced  as  directed  toward  the  identifying  person.  I vidence  of  identifica- 
tion with  Mr.  Bigntt  w:i,  mimfesred  by  the  subject's  acting  in  one  or  more  of  the 
following  ways:  (a)  explicitly  affirming  identification,  saying,  for  example,  'I  guess 
Tm  a Mr,  Bigoct';  (b)  consistently  and  openly  sympathising  with  Mr.  bigou,  ex- 
pressing sorrow,  for  example,  thu  Mr.  Bigou  looked  so  weak  and  sick  in  the 
'transfusion'  cartoon;  (c)  interpreting  a threat  to  or  criticism  of  Mr,  Bigou  as 
referring  to  himself,  as,  for  example,  becoming  emotionally  upse:  by  ike  cobweb 
on  Mr.  Blgou's  fue d."2 

A single  conversation  reported  by  the  authors  of  Deep  South  beat  s witness 
to  the  depth  of  feeling  involved  in  white  attitudes  toward  Negroes  in  this 
dcepl)'  prejudiced  area:  A social  worker  described  a poor-white  family  in  which 
two  girls 

“'are  having  babies  and  are  not  married  . . . That  isn't  the  end  by  any  means  . . . 
Somebody  told  me  that  this  older  girl  was  sleeping  with  the  father...*  After  the 
recitation  of  the  case,  when  the  social  worker  was  out  of  the  room,  a woman 
whispered  to  the  interviewer:  ‘Mr*.  Wilson  says  those  gifts  h. tve  Negro  n:n  too, 
but  Miss  Trent  (the  supervisor)  won’t  let  r.ic  say  anything  about  that . . . Isn't  ih.it 
azL'ful?1  It  is  significant  to  rote  from  this  interview  that  the  incest  situation  was 
viewed  with  les  horror  than  the  infraction  of  the  caue  sex  taboos." 3 


On  the  bads  of  bis  long  participant  observation,  William  Whyte  was  able 
to  report  the  following  striking  indicator  of  the  complete  acceptance  of  gam- 
bling in  C*orr.ervi!!e: 

"When  a mother  scads  her  small  child  down  to  the  corner  for  a battle  of  milk, 
she  tells  her  to  put  the  change  c:i  ^ number."' 


Investigators  studying  the  effect*  c*  invernplovnve: 
people  of  an  Austrian  village  bad  the  school  children 


:t  c:i  the  psychology  of  the 
v'l'itv  essays  on  the  theme, 


1 For  e:,r  exa.r.p!  *:  I.TridJ  V.  W:Gr:  r n if  J !-'.*r  V:e  ; G.r-vn  ar.i  Yr *;:< 

KctV.:t«d  b.  Pop  j!ar  IT'.*.:  i,  Kuv.n  RT  n;r  Vol  J,  N*>.  4 i.Vrgu;:  1 /.  pp.  Sire. 

5 Kca.kdl-Volu  1.  c..  p 16>. 

* Allison  Dj»*:s,  BvlTah  B.  G:r  a'  JM:y  R.G*:!  .w\  IT/O  5o«;:h.  CMruo  30-i.Tp.  116. 

* Yilli.U'i  F.  V.  C .r.  m S*i:-»:ty.  CSi:  !'■  1 3,  1 1t. 
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“My  Future  Oceupa tlo::”.  The  pervasiveness  of  the  insecurity  of  the  children 
of  the  unemployed,  it?  c,vrod'.  e e:tecc  on  planning  for  the  future,  was  indicated 
anion;  other  thing*  by  the  * cry  language  used.  Children  of  employed  workers 
would  Vv  rite,  “I  will  he  . . or  “I  want  to  become  . , Children  of  the  un- 
employed tended  to  use  phrases  like  ‘ I might  become"  or  “I  would  like  to 
be  . in  the  same  «mdy  a small  boy  remarked  to  one  of  the  investigators 
that  he  would  like  to  be  an  Indian  chief,  “But  I am  afraid  it  will  be  hard  to 
get  the  job.”  * 

The  third  situation  - the  use  of  simple  qualitative  indicators  to  show  the 
attributes  of  complex  social  structures  - is  very  clearly  exemplified  in  Blurnen- 
dial's  study  of  a small  mining  community,  The  speed  and  inclusiveness  of 
interpersonal  corwiuinicn:ion  in  the  community  was  indicated 

“by  the  fact  that  should  a death  occur  at  nine  o’clock  in  the  moriung  and  the 
information  no:  read:  a resident  until  late  in  the  anernoon,  hi*  usual  expression  is, 
*1  earth  understand  why  I didn’t  hear  that  sooner’,  and  others  say  to  him,  ‘Where 
have  you  been?  Everybody  knew  that  by  noon.”2 

At  another  point  Blumcnthal  net's  the  existence  of  conflicting  qualitative 
indicators  of  the  social  contact  between  Minevilh:  and  its  nearest  neighbor,  and 
concludes  that  one  has  the  greater  weight; 

“Daring  the  heyday  of  Crystal  her  people  and  those  of  Mmeville  were  not  so 
well  acquainted  as  might  be  supposed  . . . Hotly  contested  baseball  games  end  the 
conmuMiitie s having  celebrated  together  on  the  taaiti  day  of  festivities  for  each  - 
Miner’s  Union  Day  - were  not  indications  of  far-re;  king  personal  relation;.  This 
is  shown  by  the  measure  of  social  distance  evidenced  by  th;  fact  th  at  a young  man 
venose  reputation  v,-.\s  such  in  one  town  that  ir>  ‘respectable*  girls  refused  to 
associate  with  him  could  go  to  the  other  and  fraterniee  wit!  its  ‘be 5:’  voting 
woman.” 1 

1 he  existence  of  primary  group  relations  within  smaller  emits  of  the  Ameri- 
can army  can  be  inferred  from  the  following  qua’hative  indications  drawn 
from  an  interview: 
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nonon  of  me  *\\Ttvy  caste  s y**cu\\  pointed  t < * an  Dualized  syr.uojic 

act: 

“iVAtcd  teat  sdcctt:/.  fur  office:  cacidida*:  % \.c  d vero  hr>;  J\Jur;:d  fieri  tne 
Army  and  ri\*.d:a‘;u\i  in  tuelr  su*v.*  Am*  very  different  s:.*ns 11 

J i! as  it  is  »m;  MisiMc  in  move  fiom  one  c.v>u  to  rmaho*  in  an  ethnic  c vc 
situation,  so  an  c:dA:cd  man  aSo;:i  to  become  an  officer  nur  t leave  the  Army 
system  l*:foro  reentering  in  hi>  r-cw  status.  The  continuation  ol  ims  custom  h 
a certain  indication  v{  the  continuation  of  the  attitudes  o4l  r.  “caste  system'* 
in  the  Army'. 

In  disrobing  the  family  structure  found  in  Middletown's  various  classes,  the 
I.ynJ*  suggest  a p indicator  of  the  position  of  the  husoind  in  the  fatv.iliy: 

‘ h rr.Av  rot  I*.'  wholly  fantastic  la  surmise  t^:t  thac  to1;  he  «nme  sijTilScanco 
fa  tl  undir: t ;fwV.j  of  loed  maiit?!  anoei  .tion  n the  h erard  / of  term*  hy 
vhiJi  local  votiiCn  speak  of  their  hu/.vnJ*.  *lh:re  i:  a definite  accent  of  r«i..n  in  hii 
conjeg.il  relations  35  one  g^cs  vp  i-i  the  social  ;c de,  from  hay  old  run*  tl  roav.li 
'the  ni5u\  ’he'  (ru.C>t  frequent),  *the  fnirc«\  *JoS *«\  *my  busbar  •!’,  to  ‘Mr.  J.svA 
H e fust  four  ere  the  con. men  tarns  arcor.j  t'e  working  ch*«?  f.vr.ilic',  th:  ?A»t 

t v.  o amCi  g husinC'S  v!:.Sj  families/** 

T lie  Indicators  which  have  been  referred  id  rue  of  many  different  form*. 
Some  are  linguistic,  some  arc  symbolic  re  a,  se.ve  ere  due  muntery,  some  arc 
phjsica!  ebjeet*.  As  mbiiAntAe  knov  I.djc  of  bn;;ubt:c$,  t dal  cigtroxulor, 
and  teJnuolo.jy  are  ;pp!ieJ  to  the  p:oh!:i  % one  I r.  \y  expect  tver  more  Sensitive 
aid  r J ; . hie  interpret  ti-ons  of  vjcii  cpvlit  vdve  Indicators.  To  what  ext.-.  t inter* 
pritAMun  of  indicators  will  have  to  reman  r.n  rrt,  And  to  achat  extent  it  can 
he  made  a s;ien.\\  h one  of  the  impart?;::  pivbl::n$  of  qralti'uivc  rcscardn 
whidi  we  cannot  attempt  to  c!fcu>>  here. 


II  CON51  RUCTION  Ol  DhSCniPTlVr  5VSTTMS 

The  previous  seedon  discussed  wh.a;  can  he  de:ic  with  A single  "poi  :**  of 
<pjA?ii  * : h c chat*;  the  pra*;-t  <rcf:o:i  cerdJers  \ hat  One  evs  v lan  omfre  rad 
hy  A v at* -y  of  qualitative  eb-co r.t*on^  As  a ftrn  *;:p  to-ua  j iv;  ha* 
stavdiog  a fad  of  her-an  :.e;i\i:yt  vW.  os.'? r/zo  the  re;*  eh  avc.tN-? 
ins  j r da  rip  :ive  s>  te  n.  lu  . are  traes  c . has  o le  to  r;  ; */  c :Ugo:*e§  i!:ca?v 
sc:  up  ! y p;c  vhv:;  j - ./g  * ;ri  or  r * the  sreUty  half,  v * pa  :e  ! •*  ; 

further  ov.a  of  A?.Ovsl>.  In  otlnr  can  pa  *iN.  / wxo:**'"  c-’n^a  ere 
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And  in  sonic  cases  the  loe.v.dver  inu..t  crciu  !)i>  own  clasiifie  uion  system  for 
the  material  under  study.  It  is  this  latt-.r  case  which  will  he  pardculady  con- 
sidered here. 

In  terms  of  their  formal  structure,  the  descriptive  systems  created  by  in- 
vestigators can  range  from  crude  lists  of  "types",  cadi  defined  individually 
without  clear  logical  relationship  to  the  others  to  fully  systematic  typologies  in 
which  etch  type  is  a logical  compound  of  a small  rumbe**  of  basic  attributes. 
Between  these  end  points  ere  all  intermediate  degrees  of  "partial"  sysiern 
Atization,  including  some  sets  of  types  which  include  in  tluir  definition  virtu- 
ally nil  the  logical  elements  necc^ary  to  set  up  a inulii-dimcnsional  "ait.ibute 
space",  Imt  in  which  the  logical  ;.;»:dysli  has  not  been  explicitly  made.  Ins- 
criptive systems  may  also  vat y in  lenm  of  their  degree  of  cor.  .Terence  or 
generality.  A fully  $>  stemati*  typ.dogy  may  he  based  on  dinwruionso4  a highly 
limited,  concrete  nature,  while  a pi  limhutry  classification  can  be  hron  I and 
general-. 

ftrclinu’i.ny 

A classification  wl,\h  fall*  toward  the  vr,<)  stentati'Cvl  end  of  th<*  cniutuum 
can  be  called  a preliminary  one,  vrCe  it  represe -its  an  csseriti*!  first  step  toward 
the  ideal  of  a fully  systematic  cne.  *1 K:  importance  of  ihi.  (aa  step  from 
completely  unordeted  J.u*t  to  ? preliminary  vhvdhccmon  must  ‘sever  l*o  under- 
estimated.  Until  die  data  ,vc  ordered  in  sornevv.y,  ti  e ei  dydsof  re!  vionship; 

rh't  rormdly  ihvtb*;*  out  of  fhc  atO  mpt  to 


n preliminary  C1 


.ere  b An  interactin': 


analysis  of  relationship 


cannot  legindniorr  refined  era 
di:.il)/c  relutonsHps  he: 

process  between  refinement  of  classinC  j;I.  a anu  tia 
A good  p Himhnry  c!  tsdfuathm  1 usr  piovide  \ workable  summary  tf  the 
wealth  of  eh  % en»s  in  the  Origin.*;!  data,  ai  d include  - even  if  in  unyvster  > ;t'c 
form  - the  basic  element*  rcecoaty  for  underst*  nhng  the  situ  non.  \ \ aJ 
prcliminmy  classif cation  is  er  e which  is  peer  in  c\?  tents  and  saygestixeness. 
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is  Louis  VM rtli's  si'.^jjcsrive  rc‘.;s  on  “so:,!?  Types  of  I\  i s o: i ’.! 

diTm.'s  bis  purpose  in  usinj  t!»;j  as  t jl’ov.i: 


y”.  Wirt’s 


"Ihe  sociologist,  5r.  t:  nnsf rung  the  unk;o: 
represemtiw  or  typhal  one,  arrives  at  tl^o  sc:'." 
attitudes  on  the  p tre  of  the  person  toward  himself 
in.*  set  of  attitudes  of  tin*  group  toward  him  . . . TJ 


or  individual  c <;.v!cii:c  into  .1 
I type,  which  cor? 5: st s of  a s:t  of 
and  the  $/vup  and  s correspond- 
l.e  range  of  pemndhy  t>  pes  in  a 


given  serial  group  is  indicative  of  the  culluic  of  that  group."1 


Wiitl/s  gallery  of  ‘'characteristic  and  picuircs^j:  personalities 
with  in  lire  average  com:minit)  M ir, eludes: 


tint  uiC  met 


the  Mtnttk,  a person  of  s jperiot  economic  status  v ho  his  "achieved  Ire  success 
without  sacrificing  lis  identity  as  e.  f:vv"; 

the  ?r:cvt  who  “in  Ids  opp  uir.ism,  h\‘  thrown  overboard  rues:  of  the 

cultural  htgg  tge  of  hU  c<v*;p“; 

the  5 ev/er *!.'  /,  who  belies  the  stereotype  of  the  Jew  as  “the  persordfc tticn  of  the 
commercial  spirit*'  by  being  “qehe  s!};rt!»‘*>  ana  helpless,  falling  i.dser.d'iy  it 
everv  thing  he  undertakes"; 

the  /.'/IWnic-e,  v,  l.o  move;  easily  from  one  ur.mceo  fid  project  to  another,  ;.r,J 
whose  "only  apparent  incer.*  of  sulsistcr.ee  is  t!'c  air  he  breathes"; 
the  }YfiV'.t  /* 0 ; r,  literally  the  uh'uJksl  student,  die  >ou-g  r ,an  v,ln:  learning 
pvt;  precipe  itfcpectivc  of  wealth  or  origin; 

the  y.jJtiik  t th*' pious,  pitiiarJnl  person...  ^ho<£  exemplify  conJecr  i*  peo'.cd 
to  m an  example";  ar,J  10  on*. 

1 lie  ptnp^c  of  presenting  these  types,  drawn  r cJy  from  the  Mhlav  and 
liter Alu re  of  the  subject  group,  lies  in  the  /act  that 


“ib:)  arc  as  co  nplcte  ar.  index  as  any  at  p c, err  obta'mWc  of  the  culture  tr; its . • • 
am!  the  col;  re  pattern  of  the  etc-jp..,  Together  :Vy  tomtintr  the  personal 
ns  ;hi  around  v him  the  fstaJe  of  thn  cuhvrecf  the  giorp  15  voviti.  A detailed 
analysis  of  the  croch!  personality  e>  in  any  given  or  cultural  gioup  shows 
that  they  d.prju!  upon  a set  of  baS>;  v d attitvd.s  In  the  group  for  their  existence 
and  are  tl  e direct  expressions  of  the  \ \,\  e?  cf  th:  group,”4 


Stuting  f rc*;;i  the?e  typ:>,  therefore,  cn  o*r«  derive  a clA^>if;catiun  of  the 
\Cthss:,  habits  anJ  auiuk\s  vnlch  arc  impciur;  to  the  c>p!a;>e.tl of  t !sC 
behavior  cf  the  group. 

hi  much  the  same  way,  from  folklore  and  lhorcture  as  v-wH  as  fetiA.vl  oS* 
servet  or,  C Vhiphi  ?%!ilh  draws  a gOIciy  of  **v. J.iovccl'a;  type;.”  T' here  are 
t > pcs  of  riA'  'gvts; 


tbc“Llvm 
to  »;  iy  i 1 


r ;.,i“  or.  tec  rep  c !*  t; 
line  \ I:S  ihe  i:n,«  of 
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the  ‘ <Al  vc l : j-jit  bdu’v  i !: j t ^p,  wh>  sed;  security  in  closely  fo!!ov  ing 
er;  ".It  in^m. lions,  and  strive  for  inference  from  those  below; 
the  !.ve  vi.tf”,  th:  yoervrr  i.nn  on  his  vay  up; 

the  "rev  on  step  re  nee  r‘\  vi  o proper*  .is  a fixer  and  go-between  iii  a world  of  huge 
and  CO:  J urgani/fitkr  s mass  tr. a:ss;  hition,  and  gtrcral  insecurity1. 

There  nrc  typ*$  uf  intellectuals5,  of  academic  men1,  amt  - nil  the  vtly  down 
at  the  bottom  of  the  white-collar  pyramid  - of  salesgirls:  Hil;e  wolf**,  "the  char- 
mer", "the  ingenue",  "the  social  pretender*1,  and  so  on4. 

"1  here  is  a serious  purpose  in  pin-pointing  tlK>e  pieiuriMjv*  types: 

"Lh  cxairir/13  vhht^coliar  life,  it  is  possible  to  learn  sonv  thing  ;b.n:t  what  is 
becoming  more  typically  *A *;:er icu fi*  than  the  frontier  character  probably  ever 
was.  V.  he  t in  Jit  be  graced  is  the  picture  cf  Society  as  a great  r.aKvioom,  an 
enormous  file,  an  in%\ rporaui*  brain,  a res.*  universe  cf  mamgenvnt  a nd  ina  -ipnla* 
ticn.  By  vnd.vj  tending  theic  diverse  whit  collar  v/«er!J>,  one  can  also  vno*..*tanJ 
better  the  shape  and  meaning  of  modern  jccicty  as  * hole,  as  v. ell  as  the  simple 
hope;  and  complex  anxieties  th:t  grip  all  the  people  who  are  sweating  it  out  in 
li  e middle  of  the  twentieth  century.'** 

The  pc ncrc.!  run  cf  preliminary  cater.  Aries  will  not  l?  as  colorful  find  rich  in 
suggestions  as  these  post  quoted,  but  they  will  be  of  the  $.m;e  formal  nature:  a 
Simple  list  of  disci  etc  "types**,  i luis  * *c  v ;:1  have  li  *ts  of  "types"  of  comic-beck 
feeders*,  types  of  chent-proictslcpal  relations,  tjjns  of  Appeals  iu  r certain 
propaganda  boude^th  typ^s  of  conr.vu.kics,  etc.,  representing  a preliminary 
ordering  of  rrmd.vir!  into  a simple  \\  i of  headings  As  the  analysts  prey*?*?, 
citl  et  v.  hi  in  the  origin  tl  study  or  in  the  work  of  replic  uioa  or  Secondary  i:;a- 
1)  si*,  these  simple  1 «t*  rn  y be  de\  doped  into  more  systematic  ar.d  more  general 
descppdvc  systems. 

Somcv.nat  further  along  the  read  to  generality  find  syitcmiZAtion  are  tlic 
hme  s of  "type?"  found  in  the  greet  dcvl  of  the  speculative  and  theoretical 
liter;  Ur  re.  lyptcal  examples  -urc  arc  SprangcrV  six  "value  types'*  - the 
theoretic  economic,  aesthetic,  Social,  pditic.*!,  end  tcbgbud  * or  von  \1e;cT 
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ur  t ypes  of  rv!iplou>  organi/ itunis  - tlie  ceclesu,  tin-  wrtotnir  ition,  tPe 
and  the  cvili*.  Merton,  in  dhcusn n the  for;  ^ ot  mverp,rior.al  intluence,  ?is;s 
the  following  type;:  coercion,  Jof.iin;.t:on,  ma.ibpiil atlom  clxrhh\uav,  pro- 
vision  of  prototype  for  initnicn.  ar.d  cu'nt^o,  Kingsley  Davis 

classifies  socn!  mums  in  u idiriom!  CAtcgoihu  folkways,  rner  s,  l.v.v,  cuironi. 
monhty,  leligion,  contention,  and  fadiionh  h.rswv!!  sots  up  eight  bade 
cat egories  of  values  which  he  iocs  to  classify  institutions  and  leader*';  Mili* 
non-jki  sets  up  seven  “basic  jie'dd’  in  terms  of  widen  cultural  phonon'cni  cat 
be  c I reified*  ; and  so  on. 

All  of  the  above  rr.cniruneJ  sets  of  categories  arc  of  far  greater  generality 
than  loose  v hull  nrbe  in  tire  analysis  o?  \ single  CiT'piJ  k*  al  stud)  of  limited 
scojv.  They  are  the  result  of  aitempo  it  genera!  analysis  of  a vide  range  of 
situations.  Oa  the  other  hand,  In  their  foruut  aspect,  they  are  simitar  m the 
ether  forms  of  pwlhuiirAry  categories  discussed  earlier.  Some  of  them  are  quite 
un system Mucd;  others  include  In  their  iLl.ruion  mot:  id  thcvVencnts  reepured 
to  5:i  up  a log’cal  structure  of  basic  attributes  from  vhkh  they  could  he 
derived,  but  this  ba<  not  been  explicitly  dor.\ 

A special  kind  of  descriptive  system  tvlrch  rrdgh.  he  mentioned  node:  this 
heading  consists  of  ordered  categories  which  are  set  up  :>  d-cvciOprne  tad  sieges 
or  degree  along  a continuum.  Ti  t s Vu\;*\  d;sti  igoished  the  of  dece- 
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The  iao>i  ii?  y !;■*  d-.-vcloj  u-  form  of  descriptive  cir  velddi  cr.n  arise  ir.  a 
q;nlit.tivc  an;«  y>i$  is  on.  in  v hrch  cedi  typ*  i«  explicitly  derived  from  the 
logical  combination  of  basic  attributes  cr  diinen$:o:n.  A simple  example  i>  the 
logic,1!  scheme  set  up  by  Kiesnen  in  hi>  study  of  political  pAiiitijaLionk  by 
examining  a set  c :*  concrete  'type  c «>cdJ  Ric»:inn  v.\i$  led  to  break  the  concept 
of  participation  into  two  l rule  element*;  emotional  involvement  and  compe- 
tence, m r.  ore  simplj,  “eating*'  and  "knovmg”.  Taking  ccch  of  these  elements 
as  a simple  dichotomy,  Isi;«:n.v<  obtained  four  types  o!  relations  to  politics: 
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A T>po!ogy  of  1'  Jin’c  Prejudice  ar,il  Dt<crinutin;!on 
ATTITUDE  DIMENSION' 


Non  r!Isc rin'vif: * tson 
lit  IIAVJOft  DIMENSION 

t)is  .rlxlnrtoi 


N on  ■ JTCJ  L'  vis  w J IVs  i uv  see  J 


1 Tv  po  1 : 

T>tc  III: 

„Thc  Vl  V crVct  Ii'l  :oV 

„TK‘  r.*i:A*’iJvt'cr  IJ'iS.’raV’ 

Tvr.H: 

Tvpj  IV: 

1 ,,'l  1 c 1 j’r-Vc v.l  it  1 1'.  vf 

,/IK-  A’bVVat  ,r  iM'i-rd" 

A mere  list  t»f  tlic  "folk-labels''  of  cr.A  type  would  Appear  snf-c ifc?all>*  like 
one  of  the  preliminary  list)  of  categories;  ikey  arc*  fundamentally  diiierent* 
however,  r’ncc  they  ate  system  aiicnlly  derived  from  the  cross-tabulation  of 
tvu  basic  dimensions, 

^1‘lic  mo  t elaborate  me  of  systematic  typologies  h found  in  la.lcoti  ikwsond 
recent  works1.  Parsons  sets  forvvai  J fis^diekotoMOus  attiibute?: 


1.  Aftceiivity  - Affective  ruu  it/; 

2 . Self-orient. u;ori  * C\>I!«t!vky  orient  .uion; 

>.  Urtv<z*»!i»m  - l~a;ticu!aris;.i; 

4 Ascription  - Ad  i'*  en.et 

5.  Specific!*/  - r.ess. 

J ; y coml  hung  these  r\  e"p  u urn  v anabbd*  i\;riO;n  has  been  able  to  const ruvt 
general  rict  for  de:ctibing  social  ivlavoa*,  Col  total  system?,  iuJ  per- 

sonality system-. 

*liu*  process  of  constaeting  systematic  typologies  need  only  he  biiefly 
summarized  here.  The  starting  point  is  often  a good  prclimmaiy  f.t  of  ere* 
gorier  tty  examining  diem  one  derives  a small  number  of  r.rri^vt.s  v.huh 
seem  t > pictide  die  bads  for  die  distinction  made,  and  set*  d.rse  e.Urib  ne; 
up  as  a nndddimcr,<iun;l  system  (an  #,.ittribirte-sp;ce”).  Yhi«  ope. -alien  Ins 
been  tcir  id  *1  c ' suburb  dord'  of  a i attribute  Space  la  : t>;ob.v  \ O:/  can 
then  e xa-nin ' dl  of  d’e  1>  g’cally  } visible  <-  mbinx turns  of  de  hwd*  atttibews. 
litis  serves  to  locate  toe  cngoi?l  set  ot  cat  agones  % dibit*  lac  syst*  t it  oben 
shows  d m ? r:e  cn\-b!  t *;icns  i.  nc  l ::v  re  aecj  (apg.  t as  blank  ccsb;,  v*!,uc 
in  etK  r Kv:>  drst»r  ct;m*?  have  lev*  Tinsse  ■ c : . j i ■ a t c*t-;g»vy  v*»i  cveilap 
severed  cells;.  Of  ewirw  tvat  cl:  the  k-eb  dre  pm*  b’;  v '•  b -’at;*' ns  map  1 * 
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Analy;! f.  Sub  a recombination  has  been  terr.-xl  a ,,rcil  ict'onM>  and  is  closely 
related  to  the  c j cratton  of  index  fi's.nMion1. ' 

A.iiril  Subtractions 

'Jiiere  remains  to  be  mentioned  ?.  type  of  operation  which  is  \c;y  frequent 
in  qualitative  analyses:  the  pait'al  systematization  of  a concept  or  a set  of 
catcgoiics.  A good  introduction  t * this  operation  is  the  well-known  dbcussiun 
by  S:m:ncl  of  envy  end  jealousy*,  Uc  situations  in  uhich  these  feelings  arise 
are  quite  complex,  and  Slmmcl  does  no:  ^ive  an  exhaustive  account  of  them. 
V/liat  \w  does,  however,  is  to  indicate  one  important  .vpect  it:  which  the  two 
attitudes  differ:  in  the  case  of  jealousy  the  pet-'cn  feels  that  he  ba>  a claim  on 
the  cbjhct  of  his  d sire,  v hilc  in  the  case  of  envy  he  1: n 5 iu  claim,  only  dedre 
for  the  object.  Siscmcl  has  thus  partially  yabitivaed  the  Attributc-spice  by 
which  envy  And  jevlou*/  could  be  systematically  defined;  he  has  no;  done  so 
c<  mp!;:cly,  but  rather  only  enough  to  make  one  major  distinction. 

A more  elaborate  but  s:t!J  partial  substruction  is  presented  in  Winer  1 ami- 
t Acds  discussion  of  “Tj  p.s  of  Integration  arrl  Yhcb  Measurement”.  L indecket 
bey, ins  by  Ldicatirj  his  discoti'eM  v/ith  the  undiffeicntiatcd  concept  of ''social 
Integra  lio*”.  lb  study  the  relation  c f i .u/trath  n to  other  variables  Jo  find  iu 
nrcer  a \y  con  dido::'  and  stsconse-y1  me  cq  the  broad  must  be  broken 
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and  skills  in  Its  projection  [5]  'I here  is  no  split  of  v.\*rk  and  p!:y.  or  work  and 
culture.  '6]  T 1' e c.vm:.:  Ws  vay  of  live!  hood  determines  mJ  infui.i  his  entire 
mode  oi  1: vir^r.*"1 

In  clTcct,  Mills  proposes  six  attribute*  by  which  a job  situation  can  bo 
describe;!.  If  all  six  of  these  attribute*  have  the  valves  indicated  above,  vc 
have  the  i-dethtype  situation  of  "cra^nunship’’.  Use  situation  of  the  modem 
industrial  orefltce  worker,  Mills  implies*  is  the  opposite  of  tlu*  idealized  erarts* 
rmrt  in  all  these  respects.  Actually  the  six  attributes  gi\c  64  logically  possible 
combinations  of  values:  the  intermediate,  mixed  combinations  however  Jo  not 
enter  into  Mills'  present  discussion,  which  deals  only  with  the  idealtype  cases 
and  not  with  the  whole  attribute-space. 

In  tins  ca<e  each  of  the  six  attributes  ectueHy  refers  to  a relationship  - be- 
tween a worker's  capacities  and  his  work,  between  work  and  leisure,  etc.  *lhcy 
can  be  dc.ivcd  from  a relational  Matrix  v^nshtln*  of  four  e ten  cats:  the  worker 
(his  capacities,  his  character/;  the  v. o»\  activity;  the  tinal  product;  and  the 
worker's  leisure  activities  (his  ‘'play",  "culture",  "general  mode  of  living”). 

of  thc.:e  can  act  on  .Any  of  the  ethers,  as  svnvuuised  in  the  relational 
scheme  b Jov: 
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puked  out  an  .1  pul  ir.t } a t *r n-\  wh  the  >:  rate  gw  ec;  if  to  "fed  out"  i ho  =e 

which  are  irnpmunc,  which  will  iihimztclj  help  ;<►  solve  the  problems  in  vhidi 
we  .ire  interested.  Forma!  analysis  can  tN.cn  b:  used  to  clarify,  develop,  and 
cormnunie.itc  the  results  of  qualitative  Inbghts. 


111.  QfAUTATIYI.  DATA  SUGG  J *TI  NO.  K 1 I ATI  ON  > I i\ V S 


The  only  fully  adequate  way  to  test  the  existence  of  a relationship  between 
two  variables  is  through  smUtic.il  analysis;  to  test  cauw*  and*  effect  relations 
rcqui.es  cither  a controlled  experiment,  or  a rather  large  numlwr  of  case*  of 
"natural  change"  observed  over  time.  But  regard;  which  Jns  neither  statistical 
weight  nor  oxj  e invented  design,  research  h.veJ  only  on  qualitative  descrip dons 
of  a small  number  of  cases,  c *n  ncr;:tirc!e>*  play  the  import  nt  role  cf  suggest* 
mg  possible  relationship,  causes,  effects.  and  even  dynamic  processes.  Imbed, 
it  can  be  argued  that  only  reseaid;  which  provide*  a wealth  of  ird?c  elhueous, 
unplanned  impulsions  and  obse.  vrt  ons  cm  play  this  role.  Those  who  uy  to 
pet  suggestions  for  possible  explanatory  factors  for  statistical  tc suits  s:!e!y 
from  looking  at  tabulations  of  the  fee  vort.udes  which  vc;edelibc;ai?3y  included 
in  the  study  in  advance  often  tan  in  a sc  no  progress;  son.: ‘time*  ever,  t sirJe 
w*ritter.dn  convi  cm  by  a respondent  v il!  proviJj  ft  clue  to  additional  factors. 

/ i'idbig  “Arc  ton"  / A:UO'i 

A clasdc  case  of  the  vise  of  qualitative  observation  to  disclose  possible  factors 
influencing  behavior  is  the  Vest  err.  Ulcctric  study  h Vbrri  the  expe;  imer.tal 
ftonp  of  workers  mAnmirscd  their  high  prod union  even  when  physical  con* 
ditions  nre  wide  vor-c  t Inn  before  the  e.  primers:  began,  it  v;s  clear  that 
something  cL;  was  cfleetirig  their  pri-ducthvi.  V ia;  th*  real  factors  were  vis 
first  suggested  by  informal  conversation?  with  and  ob;er*a‘iom  of  the  experb 
rccc.u!  group,  and  fio.n  then  cm  the  min  tvSeardi  clTcrt  vv.?  focussed  on 
qualitative  inter viewing  and  cb.*c;%ft;!e  i ta  di scovia  wc'a!  fa.:< 
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"the  albcay  sari.*  appci:*  emerged  a<  a dramatic  event  ...  a jingle  unifvJ 
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n i altruism  phtved  ar.  imegr  al 
'the  Audience V images  cf  Smith*  the  chs* 
structure  of  our  society.  the  cultural  *:o.nJ.m?s  of  distinct  strata  of  the  po- 
pulation. :»::J  socially  induced  expectation*.  feelings.  tensions  were  all  - 
cately  involved  in  the  pattern*  of  re.-ponse  to  the  bond  drive  *;  “bu:  the 
cumulation  of  altect  ar»J  emotion  va*  r.ot  the  major  function  of  the  marathon 
broaJc.wts.  Above  all,  toe  psevarneJ  >tavs;  a«.d  strain  of  the  ei^htcendiciir  series 
of  broadcasts  served  to  validate  Smith's  slr.cewt/  . . . for  on  understanding  of 
the  process  of  p^-jado',  the  mo^t  significant  feature  of  the  e responses  to  '.he 
marathon  is  the  etTvvtivv*  . s of  this  propaganda  of  the  d\\l  amoni;  the  very 
people  ^ho  were  distrustful  and  skeptical  of  mere  exh  Ktttion/* 

A study  applying  the  fame  l-rdmi^ue  t > a broad.;  hhtoried  situation  va* 
that  of  l.'!?aheth  / e f : : r cn  the  factors  in  recent  history  v.h!ch  In'/iuenccd 
Attitude?  toward  Jtvvs  in  France*#  By  a vcr>  s.u t!l  wonder  of  detailed  inter* 
views  with  people  who  were  p, earned  to  he  goad  observer*  {.thm  t In’f 
concieigci  of  Apartment  hour'*.  the  other  half  inwhertwd^.  it  was  suggested 
that  these  a ere  four  sivm  tr<?*uenc::.g  Attica!-:*  toward  J;o<  !rt  one 

% 1 > i>%*  \ -*  .v 


rt 


or 


•r.o;h;r:  the  jer?e.at:o;,<  cutdde  1 * r a ? . . * a 


hvi  : \toe  pao, 


aware  of  the  Jew  5 a.s  a spccJ.il  croup;  ;h.  appear  a nee  of  Jewish  refugees  front 
other  eouritriv5-  ?.  ho  were  a clearly  visible. xlttTeKnt  group  in  Ftvpvh  society;  the 
persecution  of  x)  : FanJi  V*->  dudm;  the  1 ecup  tr'an.  which  aioiocd  ccitain 
ihelng-  of  g;/;*  and  a certain  teal  d tn :-e;  f w the  c who  help*.!  the  j 0 • * > : the 
restoration  of  Jewish  joes  a n J property  aider  the  Liberation,  which  obviously 
cavfed  b;.  an  l disturb.::  cc  to  * v,v  1 o:.*J,  w?. 

One  spcci d tc.hn;^ae  far  discovering  add  ti-  r I ! 

v it  at  c.v e * ^ ! : . > 
cto.t.  Yivjf  a par 
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noodle”  hum;1.!:!,,  Tj  expw.n  the>e  crocs,  qualitative  mwrvivvo  wcr c used.  It 

appeared  that  w\h  incurs  were  involved  as  the  depree  of  security  in  one’s 
Attitude •,  tlio  fcclon;  that  o i/s  he! r ^ v eto  socially  earned  a:ul  not  a personal 
responsibility,  t ho  Shot  that  tiro  subject  totally  diside:ui‘!ed  himself  from  the 
carlo  a tu  rod  t: u re  of 


Mr.  Bhrott”  and  therefore  were  under  wo  threat1. 


Q.-iwlit.tti:  e Si  of  Pro^ c>> 

Tlie  simplest  fi  rm  of  a “process”  analysis  is  that  wliiJt  Inihs  Jor  an  inter- 
venirq;  variable  v%  h ivli  “explains”  the  cor: elation  between  two  other  variables. 
In  his  study  of  an  Mast-coast  slum  neighborhood,  William  Whyte  ai rived 
(on  a qualitative  bud?)  at  the  followiia;  relation-dap:  the  socially  aspiring 
“cohere  boys"  clubs  seemed  to  be  More  unstable  ar.d  subject:  to  internal  con  ■ 
diet  than  tho<c  of  the*  non  e-mobile  “corner boyd\  ')■>  explain  tins  relationship 
(which  could  be  considered  quite  “updde-dov.  ;b*  iron  a ni:ddle-el,v>s  view- 
point) Whyte  introduced  a third  variable,  “informal  ote.ani?.uio:db  i iic  cor- 
ner b ay  clubs  could  draw  for  cohesion  on  already  existing  informal  organi- 
zation: 


"‘The  diily 
in  eir.be  is  an  I 


activities  of  the  comer  hoys  dct-.  ra  dried  the  rchttive  portion*  oi 
llocaiwd  reuvndl  : litres  a :•  J obliji  ■/.ions  vlddn  the  i;rovp.  ’ 


Anuiry  the  college  boy?  c:i  the  other  bend, 

“Outside  club  v.mv-i-*  ti  e :ue::/v:s  -Idem  uweduted  mg:  the*  e,:;<p:  ir.  pair. 
Since  t ii : ; e v- a>  no  Iuf< wu.u  org -.id/adon  to  ! :r.u  th  * men  t ’f;:dur,  there  was  nbo 
no  cvrrr.on  uud;  nrundirig  upon  tr.  t:rer>  of  authority,  responsibility,  and  ouh- 
c;  •don.”2 

lo  be  able  to  a^c  t this  explanation  vddi  any  core  ai r it y.  *:  would  be  necessary 
for  W!  \ to  to  have  observed  corner  boy  clubs  which  \»ere  v eah  in  informal 
o:v;e’d;  ado;:  and  college  boy  -roups  v Inch  v.v.c  strung  in  it;  if  due  former 
•Acre  a!:o  un-tub!  \ veiule  the  latter  v\:e  stable,  ir  would  constitute  a certain 
test  o:  too  rypot; w ?:*.  o ny.e  does  no:  record  vnemer  m*  iouc.i.  Otu  stiui  uv. 
situe  f or  v.  as  i.-nme*  vitun  range  oj  sum  ca>e-. 

I ; : toe  same  ay  toe  re!at:?:i  ^ctwcei  m onnersiup  n;  a come:  gang  -mu 
■ I ?!  e to  rj  -c  vcoii  thy  v as  c X p j » 3 c u m ' ,ur  > o:  :/c  u/patt  croup 
r<d  itio.-'S  on  sav  !::p  end  speivd!::.;  Itibit Wiiyte  d.rdcs  that  t .vc;..u  u;  o'.ierea- 
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“ I He  pattern  ol  social  rnchiiity  in  CuiKTeille  can  L est  be  understood  when  it  is 
contrasted  with  the  patter;1  of  corr.jr-bov  activity.  On:  of  the  most  important 
divergences  .irises  in  nutters  involving  the  expenditure  of  money.  The  colLge  boys 
fit  in  v. j tli  an  economy  of  snvl:  gs  ami  investment.  The  corner  boys  fit  in  v.ith  a 
spending  economy.  The  college  bey  nnm  save  his  money  in  order  to  finance  his 
education  and  launch  Ins  busbies-  or  professional  career.  Me  therefore  cultivates  the 
middle-class  virtue  of  thrifh  Jn  order  to  participate  in  group  activities,  the  corner 
boy  must  share  his  money  with  others.  Ff  he  has  money  and  his  frier  J does  nor,  h 
is  expected  to  do  the  spending  for  both  of  them  . . . Prestige  and  influence  depen 
in  part  upon  free  spending.” 1 

This  observation  indicates  some  of  the  factors  in  the  process  of  social  mobility. 
Of  course  behind  each  such  factor  uncovered  are  other  factors  - the  variables 
which  for  instance  determine  who  s:hh>  with  the  boys  and  spends,  and  who 
breaks  away,  saves  money,  and  rises. 

Jhe  ui i covering  of  possible  processes  can  go  much  further  than  inserting 
a third  variable  in  a chain.  The  study  of  anti-prejudice  cartoons  mentioned 
previously  suggested  a whole  chant  which  led  up  to  nnsundc; standing  among 
the  non -deviant  two  thirds  of  the  prejudiced  people  who  misunderstood: 

1.  Identification  with  Mr.  Bigott  and  momentary  understanding; 

2.  Desire  for  escape  from  idtmc’iieation; 


Mr 


m.teucctu \\ 


3.  Dhidentificat'Oii  median -sm  (caricaturing  Mr.  Ifigott,  nuKn.g 
or  social I)  inferior); 

A.  Derailment  of  under  standing:  absorb:  ion  in  die  derogatory  diarae:erTt:cs  of 
Mr.  lfigott  to  the  exclusion  cf  understanding  the  point  of  thr  cartoons*. 

In  the  Whyte  study  one  find-  the  process  of  the  ri«e  of  a local  cornet  hov  to 


Since  he  has  only  limited 
favors*  he  must  ' 


cymc-ni  to  war  . I p: 
to  cause  constant  turnave:.  II.; 
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of  equilibrium,  presumably  a:  a level  determined  by  the  political  abilities 
initial  “connections”,  and  good  luck  of  the  individual  politician. 

In  exploring  for  possible  factors  affecting  soma  given  variable,  or  for  chains 
of  causes  ar.d  effects  constituting  a “process”,  there  appear  to  be  two  basic 
techniques.  The  first  attempts  to  obtain  objective  information  about  the  se- 
quence'of  events,  particularly  what  events  preceded  the  response  under  In- 
vestigation1. The  typical  questions,  whether  addressed  to  a subject  or  used  by 
an  observer  to  guide  Ids  observations,  are:  “What  happened  before  X?  What 
happened  just  before  the  subject  made  I. is  decision  to  move,  vote,  buy,  steal, 
etc. /What  was  the  frame  of  mind?  What  had  been  going  on  in  the  family,  neigh- 
borhood, nation,  world?  Had  he  been  talking  with  anyone,  reading  anything, 
listening  to  anything?”  Some  responses  will  look  like  causal  factors  imme- 
diately, on  the  basis  of  our  past  experience  or  genera!  hypotheses  about  human 
behavior,  Others  v#  ill  only  become  prominent  when  we  notice  an  apparent 
correlation  between  them  and  the  criterion  behavior  in  severed  cases. 

The  second  technique  is  to  ask  people  themselves  to  explain  what  happened 
and  to  give  their  reasons  for  acting  a>  tlv/y  did’.  The  basic  question  hero  is 
always  “Why?”  This  technique  lus  obvious  limitations:  people  are  often 
unaware  of  the  real  motives,  of  indiicct  influences,  of  the  precise  chain  of  causes 


and  effect?,  of  underlying  nece>:*ary  conditions.  On  the  other  hand  it  stands  10 
reason  that  the  participant  knows  a good  deal  a boor  his  own  behavior,  partic- 
ularly about  attitudes,  motives,  influence-,  “trigger  events”,  and  so  on,  and 
often  can  tell  the  outdde  investigator  about  things  which  he  v.ould  nevei  have 
guesTfd  by  himself.  “Hensons”  may  not  be  the  whole  story,  but  they  arc  an 
impOiianc  f.nuce  of  iiiforiuarion  on  po.-sible  factors,  and  in  some  cases  a quite 
indlspeiislhle  source,  especially  in  the  c.wly  stages  of  investigation.  1“ y adding 
to  the  general  “why”  quciy  a set  of  more  specific  question?,  focussing  the 
respondent’s  attention  on  each  of  several  bade  aspects  of  the  situation,  reason 
questions  c:r,  obt/.in  more  adequate  coverage,  although  still  limited  to  v/hnt  tlie 
responder.:  hhirelr  is  in  a position  to  brow. 

both  of  these  tewurqm:*  arc  combined  m a technique  of  qualitative  ex- 

p!or.u*au  o:  causa!  rdnt’ons  k:  o m a;  “discerning”.  This  has  b::n  carefully 

....  ♦!,, 

.■as  I* at 


iksrrlbed  in  ?.Iirra  Komarovsky’s  study  of  the  efK::  of 
familv  status  of  the  ha-band1,  'b  ah  cadv  V.  c wc-nv 
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possible  to  unr.  rtako  a fuHmcale  rmdslcal  analysis  of  the  intern!  itions  be- 
tween all  the  pordblc  variables  Yi'ha^vas  done  was  to  take  each  came  of  apparent 
change  due  to  die  hnsbandb  lines  gloymeut  a id  subject  it  to  systematic  checkin 
Mad  the  change  already  begun  before  the  iirvernploymcnt?  Did  oilier  racicrs 
arise  concurrently  with  unemployment  which  might  have  been  the  teat  c/au-e? 
Are  the  participants  aide  to  trace  the  stcp-by-stcp  development  of  the  change, 
the  detailed  links  between  unemployment  and  the  altered  role  ol  the  liu^bari^!? 
If  the  respondents  believe  that  unemployment  v-.is  the  ion-on  fur  a certain 
change,  on  what  evidence  do  they  base  their  opinion?  Ly  these  techniques  ic  was 
possible  to  isolate  with  considerable  prom's:  of  validity  the  causal  relations 
betveen  unemployment  and  family  structure.  The  search  for  “p.):dble  factor/' 
and  “possible  comequenee/’  was  made  systematic;  within  the  limitations  of  th  ‘ 
data  teal  precautions  were  taken  against  spurious  relations. 


Q.ws  i-Sutiistics 

1 > r c v i o u *j  sect  ion  s ! i a w J e a 1 1 w i t h op . .at!  o : t > o f q u a 1 i tat  i v can a ! y ? ! s w!  j i ch  a r e 
essentially  prior  to  quantitative  research:  observations  which  rabe  problems, 
the  for  mulct  Ion  of  descriptive  ca:w;'vie%  the  uncovering  or  possible  caused 
factors  or  emirs  o!  causation  for  a partianar  plcsv  of  bvbc.vior.  Ihcse  oper- 
ation* sthmil.ue  am.!  locus  later  quantitative  research,  a:  a they  sm  up  the 
dimes  do  ns  and  c negon*..-  along  rhj  “scab"  o:  the  mohs mao vhn.h  yuan  :t.?tr.c 
iC'-i r.rek  may  id!  the  actual  frequmcic*  at  d mov rer  vents 

Ho.cever,  cut : encounters  wiy  f a..- ; a e • ml;-  in  social  science  literature  studies 
vc  j j*  m d'  not  use  :**.:•  mccnanism  or  quantitative  J u.w. unction  c.vj  statist  ice  l 
a red;.  Is  and  mill  make  the  kind  of  statements  vhhh  cuantkadve  re-caich 


m 


nakm.  Thew  statements  may  be  /mph  fit  jvmcy  dbLiibutieiij  'i.  e.  “rams: 
’Jrob. lender/*  or  ; wn  1 the  Trobti  a.uier”  brow?  c r beik  vm  ur  dem  -erd-s  : 
they  to  ay  be  coru’.mmm  (comm  bey,-  have  a vending  economy,  ‘vk.i1:  cob  me 
bey?  haw  a hu*  economy);  ihey  may  be  ^ co. .v-;tj  o:  causal  :el  ttmmb'p; 
(“if  [the  p vibchan]  eo:  ceutrates  upon  serving  oar::  grimy.  be  v ii!  never 
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as  a contmuiuj  end  meieisingly  rclmcd  j“-ur sviit  by  in.’  whole  community 

Of  SOchl!  Self LltistS. 

An  example  of  the  ihivyr»  of  impretdntd ;ti ; "quasUstatisihV  is  given  by 

JUvnard  Ihulwv  in  an  article  on  pat cicipn.uun  in  voluntary  associations: 

"Amorim an  observer)  t'wnr:lv» ' were  over* .■helmed  by  v, hu  they  did  not  fuhy 
undcri-tanJ : ino.uice  tie  folloalng  from  Charks  ar.d  Mary  Beard's  The  K:sc  of 
Artcric.iK  C/:  ,Tho  TenJancy  of  A r.wncani  to  unite  with  their  fclhws 

for  varied  purposes  . . , now  became  a general  mania  ...  It  was  a rare  American 
who  was  not  a iiieinbei-  of  four  or  fvc  soclcow  . . . Ar.y  ciewea  who  refuted  to 
: affiliate  with  one  or  more  assridatu-ns  became  an  object  of  curiosity,  if  rot  sus- 
picion*. Although  in  compander  perspective  the  United  States  nay  well  be  a 
‘nation  of  joiners’,  a seivey  of  the  available  Ju.a  on  the  number  or  people  with 
memberships  in  voluntary  association;  tweeds  the  little-known  face  that  many 
have  not  c\ e:i  a single  such  affiliation.  This  uniformity  too  holds  for  all  type*  of 
areas  in  the  United  Smu>,  whether  urban,  suburban,  small  city,  small  town  or 
rural. <M 
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culture  within  a civilized  society.  1hc>c  methods  seem  to  have  succeeded  in 
presenting  a good  first  approximation  at  least  in  the  description  o[  the  culture 
and  behavior  of  such  groups.  When  anihrnpo! >e,h-:?  now  call  ior  forma! 
sampling,  data- recording,  and  stariojcal  analysis  it  is  either  to  catch  up  finer 
details  - the  small  number  of  deviant  individuals,  for  instance  - or  to  deal  with 
situations  of  culture  groups  with  less  homogeneity  — with  groups  in  process  of 
acculturation,  breakdown  of  old  norms,  or  the  develop  nunc  of  strong  internal 
differentiation. 

Iii  situations  of  less  homogeneity  and  simplicity,  it  is  doubtful  that  quad- 
statistics  are  anything  likrt  a full  substitute  for  actual  statistics.  However  they 
can  still  play  an  important  “exploratory”  function.  Statistical  research  is  too 
expensive  and  time-consuming  to  be  applied  on  nil  fronts  at  once;  like  the 
2DC-itich  telescope  It  must  focus  on  a few  areas  of  particular  interest  for  inten- 
sive study.  Quasi-statist’cal  studies  can  run  ahead  of  the  more  cumbersome 
quantitative  procedures  to  cover  wide  areas  of  social  phenomena,  and  to  probe 
into  tangled  complexes  of  relationship  in  search  of  possible  “processes’*.  'lhe\ 
serve  as  a broad  scanner  and  “finder”  like  the  wideargled  but  less  potveiful 
Schmidt  telescope  of  Mount  Wilson  and  Palomar.  Moreover  the  gathering  and 
analysis  of  "quad-statistical  data”  can  probably  be  made  more  systematic  than 
it  has  been  in  the  past,  if  the  logical  structure  of  quautttacive  research  at  least 
is  kept  in  mind  to  give  general  warnings  and  directions  to  the  qualitative  ob- 
server. 


Sy $ t or,  xt ic  C o ;v/\.  ri son 

There  is  or  e special  form  of  research  into. relationship  vein  :h  stands  on.  the 
border  between  statistical  and  quasi- 'Statistical  methods.  Tins  involves  the 
systematic  comparison  of  .a  relatively  small  number  of  cases.  Ic  differs  from 
quasi-statistics  in  that  the  cases  arc  proceeds  along  lines  closely  approxim  vim; 
those  of  a statistical  survey  or  controlled  experiment.  However  it  involves  tro 
few  ca^s  to  actual!/  apply  statistic. d to*:s,  and  it  involves  natural  situations 
in  which  one  c v.r.ot  be  certain  that  "other  factors  are  equal”  for  the  various 
case,  bcvo  .d  :h a;e  factors  specifically  analysed,  li  Is  as  though  one  set  up  the 
tables  for  a statistical  or  expe; hncarn!  rosurch,  but  had  only  one  or  two  cases 
to  hi!  in  cadi  cell,  and  p:  haps  had  to  leave  some  entirely  empty. 

1 iu>  i arm  o;  cooperative  re  eaivn  is  to  r.r  on  pj>  uue  wuen  tne 
to  be  studied  are  v>:!  il  phenomena  of  a Ivgh  order  of  c«  :.  plexlry,  such  as  war-, 
rceulv.aou:,  1.  ryc-seek  -octal  sy^um*,  forrro  of  o'  crn-tuen. V ere  Jo  no:  ex!,: 

preutivm  ;n.  i:;s  c.t-sic  c . mparau*. „ a:.*.;  : : s o:  i :r:  : C ivm-  v.  --  were 
we  v out  v ehoi:  t.cr.tv^ne  stub  civil:  a:i:>:  >,  ah*:..;  vkh  a trend  vr  of  abortive 
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or  arrested  civili ^ Ation a1*  YAber  faced  the  same  situation  in  Jvadr.g  v*ith  the 
role  of  reHTons  sy  sicnv.  in  the  development  of  society.  l>es’dc>  toe  total  number 
of  available  cases  being  small,  each  is  a very  large  and  complex  imu  winch 
requires  great  time  and  effort  to  analyze.  liven  where  there  are  a large  number 
of  cases,  this  factor  may  compel  the  researcher  to  testae:  himself  to  the  system- 
atic comparison  of  a few.  I his  situation  anscs  in  studying  communities  or  large 
institution;.  To  describe  any  one  community's  social  structure  a sue!)  a large  jo  a 
that  most  studies  have  been  of  single  case;.  Only  after  different  researchers  over 
a generation  have  pioduced  a dozen  or  so  such  studies  can  a “secondary  an- 
alyse*’ undertake  a comparative  analysis.  In  the  long  run,  it  is  to  be  hoped  that 
data-gathering  procedures  Oil  such  complex  “cases’*  can  be  so  stinphhcd  th.it 
statistical  studies  will  become  possible.  Then  the  intensive  study  ol  one  com- 
munity, factory,  union,  government  agency,  or  voluntary  association  can  give 
way  to  a quantitative  study  of  a sample  of  such  cases,  testing  the  hypotheses 
derived  from  single-case  studies.  Of  course,  where  the  d ff  i tout  y lies  in  the  fact 
that  there  is  only  a handful  oi  case  i all,  the  comparative  method  ist  tuc  best 
we  can  do. 

An  example  of  systematic  comparison  of  a small  number  ot  cases  is  oiiered 
by  Lip  set’s  >tudv  of  the  Canadian  province  of  Saskatchewan.  The  population 
of  the  province,'  mat  iffy  wheat  In  mere,  had  a remarkably  high  level  of  parti- 
cipation in  political  afftirs  and  in  collection  lor  this  unusu  A behavior,  com- 
parisons were  first  made  with  areas  wnerc  peiticipao.ou  v.vs  hno*\n  to  ' e he  . 

1 he  amount  of  participation  in  public  affairs  in  the  large  cities  of  Canada 
and  the  L . S.  is  notorious! v low.  Tilts  i * true  even  of  citm*  Imo  loionto  and 
Vancouver,  which  resemble  S.vkatchewan  in  giving  a large  vote  to  the  new. 
radical  C,  C.  F.  party.  Compating  Scokatchcwan  with  these  large  citns,  I.ip?L\ 
was  struck  In  the  smallness  oi  political  turns  in  Sc.'1.; a snev,  mi  and  the  hug* 
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tend  to  be  ra.orop Mi/ed  by  upper-class  pctrple  ai:d  professionals;  in  Saskatche- 
wan school  boards,  telephone  companies,  marketing  agencies,  etc,  had  to  h: 
stalled  by  ordinary  farmers,  who  thereby  acquired  organizational  and  political 
skills  unknown  to  the  average  city  dweller. 

Certain  rural  are  is  also  are  highly  siivuiiiej,  and  in  men  area,  the  rate  of  mass 
participation  is  abo  low: 

“Within  the  rural  areas  of  the  Southern  States  or  in  part,  of  California,  where 
si^nihenat  social  raid  economic  clewag*  oasts  with  in  the  ru  d coinnv.vuty,  the 
weilthicr  and  uppmcliss  farmer,  are  the  ♦oritia!  c«*:air, unity  leaders,  and  the  be:k 
of  the  poorer  farmer,  arc  politically  .ipaadwdc.”  (p.  2C.?J 

A third  structural  factor  distinguishing  Sadtatcheveau  is  its  e>:p  vuiv  to 
extreme  economic  fluctuations  due  to  the  unstable  price  ol  its  one  main  crop 
ami  to  the  recuirence  of  drought.  In  this  it  can  be  contrasted  vi'h  its  eastern 
neighbor,  Manitoba,  which  has  more  diverrihed  crops,  stable  markets  and  more 
reliable  vvathet.  And  ir  is  notable  that  Manitoba  today  has  much  less  com- 
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active  ill i.eighbo; in.;  urban  areas;  its  wide  "spread  politic a»  paitictp.uiou 
resembled  thru  of  the  SadmtJk  v.\m  fr.rnv.Ms  (pp.  30 3-3T.4).  The  generali'/ Alien  1 
of  explanatory  factor?  from  “the  wheat  economy”  to  attributes  applicable  to 
any  community  obviously  opens  up  a much  wider  range  of  cases  ior  use 
in  comparative  analysts. 

This  comparative  analysis  of  areas  of  high  and  low  participation  can  be 
summarised  in  the  following  scheme: 


Scheme  of  Factors  Accounting  tor  Political  Participation 
(Attril'itti's  in  / tnntrcict  'were  n>jt  ex;  licit!  y ducttssci  m j'v  ew.p.irisoas} 
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lr»  the  study  just  discussed  it  appears  that  comparisons  were  used  not  only 
to  suggest  explanatory  factors  but  aho  to  oficr  supporting  evidence,  as  a hind 
of  cjuaM-exjHM imcntal  test.  One  of  the  most  celebrated  b staras  of  suJ:  quad- 
experimental  tc>t?  is  found  in  Malinowski's  study  oi  the  u > e of  magic  in  th c Jro- 
brimd  Islands1*  Malinowski  wanted  to  test  the  old  theory  that  primitive  man 
uses  marie  because  of  a childlike  confusion  of  the  real  and  the  imaginary  or 
because  of  same  ir-.tinctlve  belief  in  the  supernatural.  He  found  the  Iro- 
brianders  engaging  in  some  activities  - for  instance,  fishing  w::h;n  the  lagoons 
- for  wbV:i  their  tedm o! j:;v  was  adequate  to  permit  certain  emnor.uc  iciurnf 
and  i' arson al  safety.  Other  activities  - for  instance  f'hmg  in  the  opm  sea  - 
involved  u;i  vaainty  cu  return  and  rid\S  to  life  wind:  could  r.o:  be  chm.i: atrd 
by  available  uJ.rmMgical  ivciu;.  In  the  s-fe  and  certain  acti  rid  ;.,  rm  magic 
was  used;  :r,  the  unsafe  a nt  vuev;  zmu  cy  •,  :vigv:  \ a>  used  a gre.*.?  O.  !.  1;  n 
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techniques,  but  a supplement  to  them  vluv  dealing  with  situations  beyond  the 
power  of  available  rational  tt * Ji tiolo^y t which  created  severe  emotional  strain, 

Of  course  the  u-e  of  companions  of  mu  ill  numbers  of  cases  as  tens  inquires 
great  caution;  care  must  be  taken  to  see  that  other  significant  factors  are  in  fact 
equal,  p.pd  that  cases  are  selected  in  an  unbiased  manner. 

IV,  MATRIX  FORMULATIONS 

Sometimes  the  analysis  of  qualitative  observations  con f toms  a mass  of 
particular  facts  of  such  great  number  end  variety  that  it  seems  quite  unwork- 
able to  treat  them  individually  as  descriptive  attributes  or  in  terms  of  their 
specific  interrelationships.  In  such  a situation  the  analyst  v ill  often  come  up 
with  a descriptive  concept  on  a higher  level  which  manages  to  embrace  and  sum 
up  a great  wealth  of  particular  observations  in  a single  formula.  Take  for 
instance,  Ruth  Benedict's  description  of  the  Zuni  Indians,  which  mentions  their 
avoidance  of  drugs  and  alcohol,  tln-ir  lack  of  personal  visions,  their  placid 
response  to  divorce,  their  “mild  and  ceremonious'*  relation  to  their  gods  and 
so  on.  After  presenting  a great  many  such  particular  facts,  Benedict  is  aide  to 
sum  them  up  in  a single  formula:  rJ  he  Tuui  culture  has  an  Apollonian  pattern  - 
that  is  a antral  theme  of  avoidance  of  emotional  excess.  This  pattern  or  theme 
permeate;  every  aspect  of  Zimi  life1.  Such  a formula  capable  of  summing  up 
in  a single  devrhrdve  concept  a greet  wealth  of  t articular  observations  may 
be  called  a matrix  formulation.  This  definition  covers  the  notion  of  a “Basic 
pattern**  oi  a culture,  a “theme**,  an  “er!i:T*,  a ‘Vcugoht**  or  “inantahty  ot  the 
time”,  a “national  character’*,  and  on  the  level  of  the  individual  person  a 
“pe  r > o n a I i t y type”. 

Matrix  formulations  may  be  applied  to  complex  units  at  any  level,  In  a 
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o„  of  ,11  ti,;«  Am  Mr  •>*  f‘"!1 

X .••*» 

?"« ft  ffff fro,, 

[he  people  had  nothing  tu  ,a>,  ttu>  tu  - 

kind  ofV'”cr-al  pm'yt"  ci  mc,ual  Ci,;r^u's'-  , , _ . ;>MUtcj  AS  “in- 

I;,  a study  of  a particular  group  ~ people  v.-lto  h-d  b.^n £**  ;*  “ ..... 

fluentials”  in  an  American  community- of  no  avail  in 
plaining  their  diverse  behaviors.  a«« ^ ‘ ' , . , pnrl;a,hr  be- 

accounting  for  the  wide  range  of  ob,cn  atio,  , 1 J inv,wi„cs  - 

havior  on  v.l,iA  the  research  seas  (ccomd  ‘gf  discrete  imorcssiot.s 

remained  unexplained.  In  trymg  too  ««  be  - W‘f  one 

not  closely  related  one  to  tnc  oihc.  • . u.  ^nic  wcre  "cos.no- 

politan  , pnni'idy  mtetou*  »n  t u ••  comnnmity  itself.  Merton 

others  were  “local”,  pr.nnr  y .,Un,,au,.  tm.  c • f i U;r,tl  wl„,n  he 

clearly  indicates  the  typical  function  of  such  a n.ntv.,  io...u.,a 

other  differences  between  the  local  md 

of  olwdit^rnf.es:  (1)  in  thv  sutures  of  social 
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lacion  uf  an  Apollonhu 

i culture  war  a ell 

.iUV.c  in 

terms  of  t 

he  variablv  “emo- 

tinned  tone”  - their  t mo 

dona!  tom;  was  !r 

>•'.  in  ir.u* 

iwity.  Thus 

. all  v.  cm  ti'.ythei 

in  the  fame  sen  so  that 

one  can  classify 

in  .1  sin.’ 

;!e  gamp  , 

ait  n-gions  v.  it : i a 

v e r y eve  it  tern  p e i a ture, 

vi  alt  people  wi 

tn  .i  liiji 

blood  pro 

is  uv.  In  ,t  matrix 

fornnil-ii’Oii  stub  as  ‘lonnicd  “Genvir.arhnfl”,  the  dements  warn  to  be  involved 
in  causal  rotations  anal  precedes  with  oik*  a.notherb  "1  Iv  element  ol  “reciprocal 
taut”  for  example  can  he  condde.vd  as  growing  out  of  the  element  of  “pro- 
longed face-to-face  association  with  the  ;me  people”,  as  can  a great  many  of 
thcotlici  ciouacteristicsof  <t  “Gomel  ns-di .^“-situation.  In  the  matrix  formulation 
of  an  “anal  personal i ty * \ the  behavior  characteristics  arc  thought  of  as  all 
arising  ont  of  a single  ha>k  factor,  the  trxat'.ua  of  the  cuvbc  development  at  a 
certain  chiIJidi  stage.  In  man)-  of  the  enhua -pattern  formulations  since  Bene- 
dict's purely  descriptive  one",  v*e  finrl  the  idei  tint  all  of  the  elements  in  the 
pattern  are  products  of  the  pattern  of  child  training  •-  or  even  of  a single 
element  in  the  child  training  pattern.  Some  matrix  formulations  invoice  a 
mixuneof  descriptively  related  and  causally  related  elements. 

Yer  another  way  in  which  matiix  formulae  can  he  differential* J could  be 
called  their  “project: ve  Ji>:  mcc*\  The  following  example*  should  indicate 
vrh at  Is  meant  by  this  dimension.  In  Mer  lords  stud)  o\  M.iss  /hvxm:  it  was 

found  that  a vide  var! et/  of  nmrrV;  made  by  the  Kate  Sir. * t !>  devomo s could 
be  summed  up  in  the  notion  of  “submiss*  \ e:.e  ■;  to  the  stains  epm/'x  *1  It  ay  !.v;lieved 
the. t it  was  right  for  come  to  be  poor  and  ethers  rid.,  they  accepted  their  p-. 
don  in  tin*  system  of  sti.vdtJ  cation,  they  tetioiultacd  that  the  rich  lud  so  many 
troubles  that  it  vv.s  j t as  well  to  be  poor.  The  more  go. Krai  descriptive 
concept  follow;  very  directly  from  Cue  mankes:  content  of  tlte  respondent’s 
statements;  they  could  almost  have  made  the  general/ vdon  thev.wlvcs. 
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P.mons  gives  ;.s  e::.v*:gh>  of  secondary  descriptive  schemes  typologies  o! 
social  ivlc.ti oiadu;v,  personality  types  And  descriptive  categories  applied  to 
groups.  Vhile  IV.r;  >nd  concept  is  not  entire!)'  identical  vekh  that  of  the  matrix 
formulation,  ii  ilk = -t rates  rnv.wk  of  the  reasoning  Behind  svdn  complex  vie 
scripiivc ‘concepts  v hick  sum  vp  e.r.d  render  manageable  a large  a i d varied 
body  of  individual  point*  of  data. 

V.  QlIAl.n  \’HVi  SUPPORT  OF  THI.UKY 
So  f..r  vo  haw  mainly  discussed  v. ays  in  v.*hhk  cjoahtative  data  can  con* 
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drew  c:\  from  literature  ns  well  as  ro i ! cas e-m>;o;:es  . F»ul  lor  t.:e  use 

v.e  have  in  mind  it  is  important  that  the  examples  are  real.  Since  they  are  :k*; 
systematically  samples  o:  p;c*  i<;I\  measured,  they  do  not  oiTer  rigorous  proof 
in  any  sntht.ecd  or  experimental  s:n?r.  Ve;  ncconhm;  to  their  number  range, 
and  iclatior  t j the  reader's  oven  experience  they  offer  vatxing  degree,  of 
suppa;t  or  eoi rohoratian.  It  is  thi>  function  which  vec  vi,h  to  examine. 

Psyd.oanalysb.  Marxism*  and  oilier  theories  of  binary  in  general  or  of 
overall  personality  development  arc  1 if  go  and  complicated  structure*.  The  u:e 
of  qualitative  de.ti  in  supporting  such  cop:;*!-.  :c  theoretical  systems  is  one  of 
the  major  undeveloped  a.iea>  for  r.iediodoiogic  *1  analysis.  In  order  to  find 
r«;  itngcablc  ilIu*:iatior*  of  the  use  of  qualitative  data  t » support  theory,  we 
will  re-trivC  ours  I s c,  to  a n:t:<h  simpler  type,  vhiui  may  be  called  "trend 
theories".  These  arc  theories  which  call  attention  to  cne  particular  vc.d  in 
society,  Vitally  derived  from  *omc  underlying  change  in  the  cconcnvc  or  demo- 
graphic structure. 

\cc  \ ill  comide.  three  suvh  theories.  Inch  Fromm  has  s:g;e J that  a major 
nndency  on  ov»r  time  is  "self-alien 'tlo:i’\  rcmltir.g  ftom  ir.e  i*:<;curi\Ci  and 
disruption  of  social  bonds  brought  about  he  the  urn  of  the  rnadsei  economy  and 
i;  du^trialization.  C.  Vrlght  Mills  sugge-t*  ;h  the  rb,  rj  tr.**  big  chy  ai  d the 
rtendardizat* :n  t >te?  subjects  in  uvwh  numbers  o?  people  to  stmuoiw 
cut  it'. . i as>veii  propose*  ii*e  "dev el  jpmsual  ce  n tiuei'  of  ti  e garfir  m state, 
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INTRODUCING  ANOTHER  VARIABLE 


One  of  the  most  important  tasks  in  the  analysis  of  data  is  to 
discover  what  we  can  about  cause  and  effect  relationships.  This 
does  not  mean  that  we  can  conclusively  prove  our  assertions, 
since  they  often  turn  on  assumptions  which  have  not  yet  been  test  1 
or  which  are  untestable  for  all  practical  purposes.  But  we  still 
must  be  as  explicit  as  we  can,  showing  the  logical  structure  of 
our.  arguments  and  the  points  at  which  empirical  data  are  rele- 
vant. It  is  the  nature  of  this  task  which  Is  discussed  by  Pro- 
fessors Goode  and  Hatt,  using  terms  such  as  intervening  vari- 
able, antecedent  variable,  specification,  and  so  on,  which  are 
part  of  the  working  vocabulary  of  the  social  sciences. 


From  William  J.  Goode  and  Paul  K.  Hatt,  Methods 
in  Social  Research,  Copyright  1963  by  McGraw- 
Hill  Book  Company.  Used  by  permission  of  McGraw- 
Hill  Book  Company,  pp.  313-358. 
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William  J.  Goode  and  Paul  K.  ilafct,  Mct.l  ods  in  Social 
Research  (Rev;  York:  KcGrav-HJll,  1952),  pp  313  *358. 


A main  thread  of  thought  tunning  through  tins  volume  is  that  sociology 
r ests  upon  the  same  foundations  as  do  all  other  sciences.  Its  assumptions 
regarding  the  empirical  world  arc  the  same,  and  the  piccauiions  which  the 
sociologist  must  observe  in  cairying  out  research  must  also  be  observed  by 
scientists  in  other  fields.  We  me,  then,  at  tire  beginning  of  an  era  in  which 
the  conscious  application  of  scientific  methods  to  sociological  problems 
should  be  marked  by  great  advances. 

This  premise  is  also  based  14*011  a fundamental  asmmplion  of  sociology, 
constantly  borne  out  by  daily  observation,  that  there  i*  an  orderliness 
within  social  phenomena.  Ju«t  as  the  psychiatrist  finds  a system  or  struc- 
ture within  the  seemingly  most  ir coherent  expressions  of  a schirophrenc, 
so  can  the  sociologist  find  ptcdic lability  and  order  within  the  most  anomic 
or  disorganiied  strata  and  social  groups*  Ifi*  Intimate  spontaneity  of 
friendship  groups  exhibits  this  order  lines*  no  k;$  than  the  iinpmou.il 
memoranda  within  a bureaucracy.  Indeed,  if  the  behavior  of  other  mem- 
bers of  society* were  not  predictable,  it  would  not  be  |>os? ib! c to  communi- 
cate, interact,  or  maintain  any  of  the  common  enterprises  in  which  we 
take  part. 

It  follows,  then,  (hat  modern  research  must  reject  as  a false  dichotomy 
the  separation  between  "qualitative”  and  ’ quantitative”  studies,  or  be- 
tween the  "statistical”  and  the  "nomtatistical”  approach.  The  application 
of  mathematics  to  sociology  docs  not  ensure  rigor  of  proof,  any  mote  than 
the  use  of  "insight"  guarantees  the  significance  of  the  tftesren. 

The  fundamental  questions  to  f*«k  about  ah  research  techniques  are 
those  dealing  with  the  precision,  icliabilitv.  and  TclcvanrC  of  the  del?  and 
their  analysis:  (1)  how  precise  arc  the  observations?  (2)  can  other  scientists 
repeat  the  observations?  and  (S)  do  the  dart  actually  satisfy  the  demands 
of  the  ptobkin,  that  is,  do  they  jw.maUy  demonsuate  the  conclusion?  If 
the  observations  arc  crude,  casting  them  in  a ttatmital  form  will  not  help 
the  research.  If  other  scientists  cannot  repeat  them,  mathematical  manipu- 
lation is  futile.  If  the  data  do  not  satisfy  a rigorous  logic  of  proof,  the 
conclusion  remains  doubtful. 

Furthermore,  no  matter  how  precite  measurement  may  te,  f X uhich  is 
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measured  remains  a quality*  Quantification  simply  achieves  greater 
precision  and  reliability  in  measuring  f/.c  qua’ifu'i  u7irV/i  arc  considered 
important— intensity  of  ami  Semitic  attitudes,  degree  of  social  cohesion, 
conformity  with  moral  rules,  etc.  Tire  process  of  achieving  precision  leads 
to  the  clarification  of  ideas  and  helps  to  recast  substantive  knowledge,  blit 
in  a fundamental  seme  the  research  may  nevertheless  be  called  qualitative. 

Similarly,  the  most  "qualitative"  of  social  research  attempts  rough  meat * 
urement.  The  historian  speaks  of  “a  growing  antimonarchical  feeling 
during  the  reign  of  wouh  XVI."  The  anthropologist  contrasts  the  intensity 
of  emotion  aroused  by,  say,  the  murder  of  a kinsman  as  against  the  minder 
of  a tribal  enemy.  The  economist  may  single  out  those  who  find  the 
security  of  income  fir  more  important  titan  the  promise  of  great  rewards 
when  they  are  considering  the  choice  of  an  occupation.  The  sociologist 
comments  that  tire  individual  who  is  well  integrated  within  a strongly 
cohesive  group  is  better  able  to  withstand  emotional  shocks  than  other 
individuals.  In  each  of  these  statements,  there  arc  implied  measurements 
of  Important  qualities.  We  may  or  may  not  be  satisfied  with  the  degree 
of  precision  in  these  notions,  but  they  are  essentia!!)  attempts  to  measure 
the  effect  of  different  variables.  It  is  equally  clear  that,  when  data  are  suf- 
ficiently precise,  statistical  techniques  can  simplify  the  task  of  understand- 
ing them.  Such  techniques  a?c  aids  in  research,  just  as  good  methods  of 
recording  data  can  be,  and  they  should  be  used  whenever  the  problem 
permits  it. 

Thus,  the  increasing  use  of  statistics  is  not  the  distinguishing  feature  of 
modern  social  research.  Rather,  it  is  the  increasing  precision  and  reliabil- 
ity of  research  techniques,  and  higher  standards  of  proof,  which  have 
made  the  use  of  statistics  more  fi  ttitful.  In  turn,  the  increasing  /urd/irbim 
of  statistical  manipulation  has  stimulated  further  developments  in  both 
sampling  and  statistical  theory;  while  the  needs  of  modern  statistics  have 
stimulated  still  greater  piccisicn  and  reliability  in  the  collection  of  data. 

These  developments  have  occulted  on  many  fronts,  such  as  the  better 
identification  of  the  important  social  variables,  increased  precision  ot  the 
questions  used  in  schedules,  and  a belter  grasp  of  interviewing  techniques, 
among  others. 

However,  most  of  this  growth  may  be  classified  as  techniques  for  stimu- 
lating responses,  or  obtaining  observations,  tcAicA  aif  easier  to  caUgotize . 
More  precise  questions  allow  the  answers  to  b?  analyzed  more  easily.  The 
mastciy  of  interviewing  permits  deeper  probing  in  the  search  for  more 
precise  answers  from  respondents.  What,  then,  of  those  sources  of  dab 
which  have  not  been  structured  previously  by  such  techniques?  Most  of 
our  daily  observation  and  experience,  the  newspapers  and  magazines  we 
read,  the  radio  programs  to  which  we  listen,  as  well  as  histoiical  records 
and  the  recorded  protocols  of  psychiatric  and  other  depth  interviews,  aie 
all  essentially  "unstructurcH**  but  may  be  important  sources  of  data  for 
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certain  sociological  problems.  If  such  data  are  eliminated  from  consid- 
eration, the  range  of  information  available  is  narrowed  and  much  of  the 
richness  of  social  experience  may  be  lost.  On  the  other  hand,  if  they  are 
utilized  as  they  occur,  little  order  appears  in  them,  and  few  fellow  scien- 
tists might  agree  to  any  one  interpretation  of  them. 

Techniques  are  therefore  being  developed  which  permit  ns  to  order 
and  analyze  such  data.  Since  most  of  these  procedures  are  really  ways  of 
classifying  data  which  were  not  originally  created  for  research  purposes, 
the  term  qualitative  coding  is  usually  applied  to  them.  When  qualitaii%*e 
coding  is  applied  to  the  content  of  various  communication  media  such  as 
magazines,  newspapers,  radio  programs,  or  similar  materials,  it  is  called 
content  analysis.  Since  the  most  effective  application  of  the  case  history  or 
case  study  to  social  research  depends  in  large  part  upon  qualitative  cod- 
ing, that  special  problem  i,  included  in  the  following  discussion. 

Succeeding  sections  will,  then,  deal  with  these  subjects:  (1)  simple  coding 
operations,  (2)  qualitative  coding,  (3)  content  analysis  as  one  application 
of  qualitative  coding,  and  (4)  the  case  study. 

SIMPLE  CODING  OPERATIONS 

The  student  may  feel  that  there  is  little  point  to  a discussion  of  coding 
operations,  since  they  are  usually  bracketed  with  large  Kale  sutveys. 
Although  it  is  true  that  such  surveys  almost  always  code  the  materials 
gathered,  the  student  may  find  it  profitable  to  consider  whether  a small 
project  might  also  benefit  from  coding  the  data.  If  the  class  attempts  a 
joint  projut,  or  breaks  into  seveial  project  groups,  coding  may  be  the 
most  cffrclise  means  of  handling  the  data.  A brief  explanation  of  this 
tool  will  allow  the  student  to  make  a decision  concerning  the  use  ol  coding. 

HTicn  to  code . Coding  is  an  operation  by  which  data  are  oiganizcd  into 
classes,  and  a number  or  symbol  is  given  to  each  item,  according  to  the 
class  in  which  it  falls.  Thus,  counting  the  symbols  gives  us  the  total 
number  of  items  in  any  given  class.  The  basic  operation,  of  course,  is 
that  of  classification.  Assigning  the  number  or  symbol  to  a given  datum 
then  becomes  a mechanical  procedure.  How  to  classify  must,  naturally, 
depend  upon  the  questions  which  have  been  asked  and  the  concepts  which 
arc  used  in  the  particular  research.  These  problems  are  discussed  in  the 
earlier  chapters  of  this  volume,  and  examples  are  given  in  this  chapter, 
l et  us  now  ask  the  practical  question,  "When  is  it  profitable  to  code?" 

The  answer  depends  mainly  on  three  variables:  (1)  the  number  of 
respondents  or  sources  of  data  in  our  study:  (2)  the  number  of  questions 
asked;  and  (3)  the  number  and  complexity  of  statistical  operations  planned 
for  the  study.  If  the  number  of  cases  is  large,  any  kind  of  tabulation  be* 
tomes  difficult  unless  the  data  are  coded.  1 he  individual  sheets  for  each 
case  become  separated  after  a few  shufflings,  or  they  become  torn  and  worn. 
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Small  errors  in  counting  the  answers  require  handling  all  the  sheets  once 
more.  By  the  use  of  coding  procedures,  however,  retabulations  may  be 
avoided  or  minimized.  If  the  number  of  questions  is  large,  the  same  set  of 
considerations  holds. 

Most  important,  however,  there  is  no  easy  way  to  carry  out  complex 
cross  tabulations  without  some  form  of  coding.  Any  statistical  operation 
requires  the  manipulation  of  numbers,  which  in  turn  must  represent  the 
data  fiom  the  schedule.  It  is  possible  either  to  make  many  piles  of  sheets 
tor  each  comparison  or  analysts,  or  to  give  numbers  to  the  answers  and 
summarize  these  separately  on  other  sheets  or  cards.  In  the  latter  case,  the 
operations  are  much  simpler.  The  more  complex  the  operations  planned, 
the  more  useful  is  some  form  of  coding. 

At  what  stage  to  code . Coding  can  be  carried  out  at  any  phase  in  the 
study,  from  the  interview  itself  to  the  period  just  prior  to  the  tabulations. 
In  the  section  on  formulating  the  questionnaire,  mention  was  made  of 
precoded  questions.  These  were  questions  which  had  already  been  field- 
tested  for  meaning  and  range  of  possible  answers,  and  which  were  physi- 
cally set  up  on  the  schedule  so  that  checking  the  answer  automatically 
coded  the  data.  An  example  would  be  the  following: 

I 2 

Are  you  a sctcran  of  World  War  11?  (Circle  answer)  — . — 

Answers  which  are  set  up  in  this  fashion  can  be  tabulated  very  easily  by 
hand,  or  they  may  be  punched  directly  onto  cards  for  machine  tabulation. 
In  this  case  the  interviewer  is  actually  coding  as  he  goes  along,  although 
no  separate  operation  is  required. 

Similarly,  the  interviewer  may  be  asked  to  do  the  coding  as  soon  as  he 
hears  the  answer.  This  can  be  done  in  a fairly  cautious  fashion,  as  an 
almost  automatic  operation: 

How  did  your  husband  feel  about  this?  1.  Strongly  approved;  2.  Mildly 
approved;  3.  Indifferent;  4.  Mildly  disapproved:  3.  Strongty  disapproved.  2 

When  the  respondent  answers,  the  interviewer  has  only  to  mark  the  proper 
number  in  the  coding  margin. 

Suppose,  however,  that  the  goal  is  to  classify  respondents  in  terms  of 
annual  income,  while  many  of  them  are  laborers  who  typically  think  ol 
their  earnings  in  terms  of  daily  wages.  It  would  be  possible  to  ask  them 
for  their  total  annual  earnings  directly.  Th>s,  however,  might  cause  them 
to  make  important  errors  in  atithmetic.  Second,  the  interviewer  might 
carry  out  the  operation  mentally,  coding  the  total  e-  nings  in  its  proper 
class  after  calculating  daily  earnings  times  the  total  estimated  number  ol 
days  or  weeks  worked.  Although  this  is  a simple  procedure,  experience 
shows  that  eun  the  best  interviewers  may  make  errors.  Most  important, 
Iiowever,  thetc  is  no  way  of  discovering  the  error,  since  the  only  figute 
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recorded  will  be  the  code  number.  Consequently,  a third  procedure  is 
indicated:  (1)  recording  the  daily  wage;  (2)  recording  the  estimated  num- 
ber of  days  or  weeks  worked;  and  (3)  carrying  out  the  necessary  calcula* 
tions  and  coding  in  the  office,  fron  these  original  figures*  Although  errors 
can  be  made,  at  least  they  can  be  checked. 

The  decision  as  to  the  time  of  coding  must,  then,  be  a matter  of  choosing 
the  phase  at  which  the  least  number  of  errors,  and  the  greatest  amount  of 
control,  will  be  achieved. 

Paper  aids  to  coding.  When  only  simple  tabulations  are  required,  there 
is  little  advantage  to  transcribing  the  coding.  A heavy  colored  pencil  can 
be  used  to  mark  the  code  symbols  opposite  the  answers,  and  the  answers 
can  be  hand  tabulated.  If  the  number  of  cases  is  small,  some  type  of  sum- 
mary sheet  may  be  used.  This  may  be  attached  to  each  schedule,  for  later 
independent  use,  or  it  may  be  designed  on  a larger  scale  to  take  care  of  all 
the  cases.  If  a special  sheet  is  used  for  each  schedule,  it  should  be  very  heavy 
so  as  to  withstand  considerable  handling,  and  designed  for  ease  of  tabu- 
lation* Such  sheets  have  the  advantage  that  several  persons  can  work  on 
this  stage  of  the  operation  at  one  time. 

The  larger  summary  sheets , containing  data  from  all  the  case.,,  are  some- 
times useful  for  quick  overviews  of  the  data,  especially  when  the  number 
of  cases  is  small.  The  accompanying  illustration  shows  a simple  sheet  of 
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Age  Group 

Religion 

V/eeldy  Wage 

20  lo  29 

30fo  39 

40  to  49 

50  to  59 

60  \ over 

1 2 3 

4 

1 

2 

3 

4 

3 

6 

7 

8 

9 

1.  Jones,  Carpenter 

2.  Smith,  Plumber 

3.  Roe,  Plasterer 

etc. 

FIGURE  I.  Example  of  a summary  sheet  for  simplifying  analysis  of  the  study  data. 


this  kind.  The  code  symbols  can  be  used  instead  of  full  headings,  as  a solu- 
tion for  the  space  problem.  It  is  possible  to  design  such  sheets  from  ac- 
counting paper,  affording  an  easy  method  for  the  cross  tabulation  ot  a 
small  number  of  cases. 

However,  when  the  numbers  of  cases  go  much  beyond  150,  the  student 
may  find  some  form  of  punch  card  of  advantage.  Unless  IBM  machines 
are  easily  available,  the  use  ol’  Hollerith  cards  is  not  a good  solution  for 
students.  A great  amount  of  time  may  be  wasted  in  obtaining  access  to  the 
machines,  or  in  planning  carefully  the  exact  operations  to  be  carried  out, 
so  that  adequate  written  instructions  to  the  operators  can  be  given.  The 
net  result  may  be  a loss  in  efficiency. 
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On  the  other  hand,  even  for  student  group  projects  there  may  be  a net 
advantage  in  using  the  McBee  Keysort  punch  card,  For  both  this  card  and 
the  Hollerith  card,  it  is  necessary  to  use  numbers,  for  the  coding  tech- 
niques are  the  same.  The  McBee  card  is  relatively  (^expensive  and  is 
procurable  in  several  styles,  depending  on  the  number  of  items  being 
coded.  It  is  basically  a card  with  a row  of  holes  punched  around  its  edge, 
In  most  studies,  a single  card  will  cairy  the  data  for  one  respondent  or 
group.  However,  more  than  one  card  may  be  used,  if  an  identifying  punch 
or  v/ritten  symbol  is  used  for  both  cards  to  prevent  any  confusion. 

Each  hole  around  the  edge  of  the  card  is  numbered,  The  technique, 
then,  is  to  open  up  the  hole  corresponding  to  the  code  number.  Thus, 


FIGURE  2.  Sample  of  the  McBee  card,  showing  method  of  opening  holes. 


with  a simple  three-choice  code,  “Approve"  (1),  ‘‘Indifferent"  (2),  and 
"Disapprove"  (3),  and  a response  of  "Approve,"  which  is  No.  1 in  the  code, 
the  hole  corresponding  to  No.  1,  on  the  right-hand  upper  edge  of  the 
card,  is  opened  as  in  the  accompanying  illustration.  When  this  set  of 
answers  for  all  respondents  has  been  thus  coded  and  punched,  it  is  pos- 
sible to  separate  those  in  any  desired  category  by  inserting  a needle 
through  the  holes.  When  the  needle  is  lifted,  all  the  cases  corresponding 
to  that  punch  will  drop  out  (because  that  section  of  the  card  has  been 
opened).  The  remaining  cards,  with  intact  holes,  will  remain  on  the 
needle.  Having  coded  and  punched  all  the  answers  in  a similar  fashion, 
it  is  possible  to  select  all  the  males  who  are  over  twenty  years  of  age,  and 
who  are  married,  by  three  insertions  of  the  sorting  needle.  Since  this 
method  is  used  only  on  small  samples,  the  actual  counting  is  a minor 
task.  What  is  important  is  that  all  the  data  have  been  classified  and  re- 
corded permanently  on  the  cards,  for  any  kinds  of  complex  operations 
which  may  later  be  decided  upon. 

Since  the  number  of  questions  may  rapidly  exhaust  the  number  of  holes, 
it  is  possible  to  use  combinations  of  the  numbers  in  the  two  rows.  This 
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requires  double  punching  and  thus  a double  insertion,  but  more  data  can 
be  placed  on  the  single  card.  Thus,  for  the  first  section  in  the  upper  right- 
hand  corner,  we  could  obtain  eight  classes,  as  shown  in  the  table.  Further 


Coded  as  Hole  Number 
Top  Edge  of  Card 

Income  Level  , • 

f 10.00  lo  J 24.99  (punch)  1 

25.00  to  59.99  2 

40.00  to  54.99  (2  plus  1)  3 

55.00  to  09.99  4 

70.00  to  84.99  (4  plus  1)  5 

85.00  to  99.99  (4  plus  2)  6 

100.00  to  114.99  7 

115.00  to  129.99  (7  plus  1)  8 

instructions  may  be  obtained  from  tne  McBee  Company  for  still  other 
operations  made  possible  by  the  Keysort  system,1 

For  large  surveys,  however,  the  need  for  greater  ease  and  speed  of  sort- 
ing and  tabulation  requires  the  use  of  IBM  and  similar  equipment,  First 
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FIGURE  3.  Sample  of  a punched  Hollerith  card  used  in  IBM  counting,  sorting,  and 
tabulating  machinery. 


developed  for  the  needs  of  the  United  States  census,  such  mechanical  aids 
have  come  to  be  applied  to  many  complex  statistical  problems  in  both 
business  and  research.  The  punch  card  used,  which  is  called  a Hollerith 
card,  is  reproduced  here.  Each  one  of  its  80  columns  contains  10  numeri- 
cal choices,  so  that  for  any  given  item  (such  as  "wages”)  it  is  possible  to 
have  10  coding  classes.  There  are  two  additional  punches,  called  "fields,” 
which  can  be  used  for  further  classes.  However,  these  "fields”  are  more 
often  used  to  direct  the  various  machines  to  carry  out  statistical  operations. 
That  is,  fairly  complex  statistical  manipulations  can  be  wired  into  the 

1 McBee  Keysort  Company,  Detroit,  MicH, 
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machine  by  changing  the  circuits  on  a removable  wiring  board.  The  “field 
punches,”  then,  control  some  of  these  operations. 

The  advantages  of  such  an  aid  are  obvious:  greater  amount  of  informa- 
tion on  each  card,  greater  ease  and  speed  in  carrying  out  simple  sorting 
and  tabulation,  and  the  possibility  of  performing  complex  statistical  com- 
putations within  the  machines  themseives.  However,  for  most  student 
projects  and  for  mos*  projects  with  low  funds  and  small  numbers  of  cases, 
these  advantages  do  not  outweigh  the  low  cost  and  ease  of  independent 
operations  represented  by  the  Keysort  system. 

Checking  the  coding  operations . Even  when  the  student  himself  is 
carrying  out  the  coding  of  his  schedules,  careful  instructions  should  be 
written  and  should  be  followed  without  deviation.  Further,  such  direc- 
tions should  be  tested  by  having  fellow  students  code  a sample  of  the 
scftedules  according  to  the  instructions.  If  the  coding  carried  out  by  others 
deviates  from  his  own,  the  instructions  may  be  ambiguous,  or  they  may 
fail  to  take  into  account  some  of  the  problems  presented  by  the  data.  This 
precaution  may  uncover  some  difficulties  in  classification  which  the  re- 
searcher has  not  clarified  adequately. 

Since  it  is  possible  to  make  errors  no  matter  how  simple  the  problems 
of  classifying  and  coding,  a definite  sampling  of  the  coding  should  be 
made,  and  every  nth  schedule  should  be  recorded  to  see  whether  there  are 
many  errors.  This  is  called  "spot  checking,"  and  it  is  a further  precaution 
against  errors  in  the  mechanics  of  coding. 

QUALITATIVE  CODING 

One  of  the  more  important  forward  steps  in  recent  social  research  is  the 
investigation  of  qualitative  coding  as  an  explicit  problem.  The  term 
"qualitative  coding”  is  given  to  all  the  techniques  for  classifying  reliably 
those  social  data  on  which  very  little  order  has  been  previously  imposed 
by  the  researcher.  Such  a definition  highlights  the  fact  that  coding  is 
basically  a matter  of  classifying.  When  the  data  have  already  been  classi- 
fied with  respect  to  our  theoretical  needs,  as  was  assumed  in  the  previous 
section,  our  problems  ;rc  mainly  mechanical.  However,  when  the  data 
have  not  been  structured  in  neat  categories  by  carefully  designed  ques- 
tions, the  problems  are  more  complex. 

As  has  been  noted  several  times,  not  only  all  social  research  but  also  all 
social  interaction  requires  this  kind  of  classification.  We  are  constantly 
ordering  the  manifold  social  experiences  in  which  we  participate.  When 
we  investigate  a past  historical  epoch,  we  must  begin,  to  organize  data 
which  were  not  recorded  for  our  purposes.  If  we  attempt  to  study  the 
social  patterns  of  a small  community,  or  of  a work  unit,  or  of  a primitive 
society,  we  must  learn  how  to  classify  the  relevant  data. 
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Similarly,  when  schedules  are  tested,  intensive  intei views  which  are 
essentially  unstructured  are  often  used,  and  these  answers  are  classified 
before  a more  structured  set  of  questions  is  formulated.  By  this  kind  of 
approach,  it  becomes  possible  to  use  important  sources  of  data  beyond 
those  already  patterned  by  narrow  sets  of  questions.  Historical  records, 
case  records  of  various  clinks  or  agencies,  diaries  and  letters,  propaganda 
and  advertising,  or  nondirective  interviews  may  thus  become  grist  for  the 
researchers  milk 

Most  research  based  on  such  records  has,  however,  been  open  to  the 
criticism  that  the  reader  did  not  “see1'  the  order  created  by  the  researcher. 
The  conclusions  were  based  on  perhaps  adequate  data,  but  the  reader  did 
not  agree  that  the  data  had  been  interpreted  correctly.  Yet  each  researcher 
must  eventually  meet  the  criticisms  of  his  peers,  and  unless  he  sets  forth  a 
set  of  instructions  for  classifying  the  same  data,  no  one  is  bound  to  accept 
the  results.  “Intuition"  is  useful  in  the  search  for  creative  ideas,  but  the 
demonstration  of  their  truth  cannot  be  founded  upon  any  expectation  that 
the  intuition  of  others  will  come  to  the  same  conclusions.  It  is  therefore 
necessary  to  develop  an  explicit  set  of  instructions  for  handling  data,  thus 
forging  a link  between  apparently  unordered  observations  and  adequately 
demonstrated  generalizations. 

Research  experience  in  this  area  has  not,  however,  been  adequately 
organized  and  published  as  yet.  It  will  therefore  be  useful  to  outline  here 
some  of  the  main  steps  which  are  ordinarily  followed  in  qualitative 
coding,  along  with  some  of  the  problems  which  may  arise, 

1,  CLARIFY  WHAT  IT  IS  THAT  IS  DESIRED  FROM  THE  MATERIALS,  Let  US  SUp- 

pose  that  pilot  interviews  concerning  residential  mobility  have  used  the 
question,  "Would  you  mind  telling  me,  in  detail,  iust  how  you  came  to 
decide  on  moving  from  your  last  address?"  If  the  interviews  were  careful, 
there  should  be  several  remarks  from  each  respondent.  Presumably,  these 
answers  represent  many  different  types  of  decisions,  or  housing  situations, 
or  families,  or  even  modes  of  participation  in  the  community.  That  is, 
since  the  question  is  only  partly  structured,  the  answers  may  be  ordered 
iu  many  ways.  How  should  they  be  classified? 

The  answer  must  be  found  in  the  purposes  of  the  question.  If  the 
question  was  considered  ambiguous  by  the  respondents,  the  answers  may 
be  studied  only  to  see  whether  it  was  understood  accurately.  However,  if 
the  question  has  been  developed  beyond  this  stage,  from  among  the  many 
ways  of  classifying  the  answers,  the  one  which  best  suits  the  aims  of  the 
total  research  is  chosen.  If  we  are  mainly  interested  in  who  makes  such 
decisions,  we  will  note  which  persons  are  mentioned.  We  may  instead 
seek  cues  to  the  process  of  reaching  the  decision  and  look  for  statements 
about  its  phases.  If  we  are  seeking  motives , comments  which  attempt  to 
justify  the  move  will  be  sought.  If  we  are  seeking  predispositions , we  will 
attempt  to  classify  the  answers  according  to  the  descriptions  of  the  pre- 
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vious  housing  accommodations,  or  according  to  the  housing  values  ex- 
pressed. 

2.  STUDY  THE  COMPLETED  SCHEDULES  CAREFULLY.  This  IS  an  obviOUS 
second  step.  It  is  sometimes  a disillusioning  process,  since  the  answers  may 
seern  less  rewarding  on  the  schedules  than  they  seemed  while  the  interview 
was  under  way.  However,  it  is  these  records  on  which  the  demonstration 
of  the  facts  must  be  based.  If  there  was  moie  in  the  interview  situation 
than  appears  on  the  schedule,  it  is  wise  to  attempt  to  develop  better  ways 
of  recording  such  data  before  proceeding  with  the  research. 

3.  WORK  OUT  THE  CLASSES  AND  THE  INDICATORS  OF  THE  CLASSES.  After 
closely  studying  the  data,  it  is  likely  that  a rough  idea  of  the  classes  will 
be  developed.  In  some  cases,  of  course,  these  classes  have  been  in  mind 
from  the  beginning.  In  others,  the  possible  groupings  will  only  gradually 
take  shape. 

However,  it  is  at  this  point  that  a first  approximation  of  the  indicators 
to  be  used  in  coding  should  be  made.  Since  the  respondent  has  chosen  his 
own  words,  they  will  not  fall  into  neatly  prearranged  classes.  Slang  words 
may  appear.  Both  simple  and  complex  rationalizations  may  be  recorded. 
It  is  necessary  to  work  out  both  the  classes  and  the  cues,  comments,  or 
phrases  which  are  accepted  as  equivalent  indicators  for  those  classes.  For 
example,  housing  respondents  may  be  grouped  into  (1)  those  who  are 
anxious  to  move;  (2)  those  who  are  looking  for  a definite  type  of  housing; 
and  (3)  those  who  are  confronted  by  a housing  “opportunity."  This,  of 
course,  is  only  one  type  of  classification,  which  may  be  used  to  interpret 
other  comments  on  tne  process  of  decision.  However,  the  respondents  may 
not  have  used  such  phrases.  The  solution  exists,  then,  in  developing  cod- 
ing instructions  (for  the  researcher  himself  as  well  as  for  others)  which 
list  the  indicators  for  each  class.  For  example,  one  respondent  may  not  use 
the  term  “anxious'1  but  may  state,  “What  a clump  that  was!  But  we  had 
to  find  some  place  to  live,  so  we  pul  up  with  it  for  a year."  Or,  “My  cousin 
picked  it  out  while  we  were  in  Idaho.  We  were  certainly  sick  when  we  saw 
it,  but  what  could  we  do?"  Both  these  comments  indicate  the  respondents 
had  a strong  predisposition  to  move  and  vrerc  dissatisfied  with  their  hous- 
ing from  the  beginning. 

On  the  other  hand,  a clear  indicator  for  the  “opportunity"  group  might 
be  any  comment  which  suggested  general  satisfaction  with  the  housing, 
until  a “bargain"  was  discovered  accidentally.  This  type  of  comment 
would  be  the  clearest  case,  and  further  details  could  be  used  to  delimit 
the  class  in  the  case  of  respondents  who  were  not  very  satisfied  but  who 
state  that  they  would  not  have  moved  if  the  new  housing  had  not  been 
found  unexpectedly. 

The  goal,  therefore,  is  to  work  out  a set  of  instructions  which  will 
enable  others  to  classify  these  comments  into  the  types  which  are  rele- 
vant for  the  research.  Since,  ordinarily,  exactly  the  same  wording  cannot 
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be  expected,  the  practice  is  to  look  for  equivalent  meaning.  To  do  this, 
the  phrases  and  cues  which  will  unequivocally  indicate  the  proper  class 
are  listed  and  checked  by  seeing  whether  several  different  coders  using 
them  will  secure  the  same  results. 

In  some  cases,  the  indicators  are  known  from  the  beginning.  For 
example,  one  might  be  interested  in  discovering  the  extent  to  which 
nursery-age  children  use  various  ethnic  stereotypes  in  their  play  language. 
The  records  might  consist  of  observations  made  through  a one-way  visual 
screen,  supplemented  by  wire  recordings.  In  this  case  the  coding  problem 
would  be  simple,  for  the  phrases  used  in  ethnic  stereotypes  are  fairly  w?ell 
known. 

A somewhat  more  difficult  problem  occurs  when  a continuum  of  inten- 
sity is  the  object  of  the  coding  operation,  such  as  the  classification  of 
answers  from  those  expressing  a strong  degree  of  disapproval,  to  those 
expressing  strong  approval,  of  a particular  object.  As  noted  in  the  sections 
on  scaling,  this  problem  is  sometimes  met  by  the  use  of  judges,  who 
classify  independently  a series  of  responses  along  a continuum  of  ap- 
proval and  disapproval. 

Even  when  judges  are  employed  for  a preliminary  group  of  responses, 
the  problem  of  which  indicators  are  adequate  for  classification  must  still 
be  faced,  for  we  are  dealing  here  with  unstructured  data.  How  many  de- 
grees can  be  used  will  depend  cn  the  detail  to  be  found  in  the  responses. 
"Strong  disapproval'  or  "strong  approval11  may  be  fairly  easy  to  classify. 
The  less  intense  responses  will  sometimes  be  indicated  by  favorable  or 
unfavorable  comments  which  are  accompanied  by  various  qualifications: 
"They're  all  right,  but  some  of  them  . . or  "I  don't  have  much  to  do 
with  them,  but  I never  have  any  trouble.  . . .” 

4.  fit  the  classes  to  the  data.  Having  worked  out  a preliminary  set 
of  indicators,  the  gradual  process  of  fitting  classes  to  the  data  must  be 
begun.  A first  use  of  this  procedure  is  to  discover  whether  or  not  the 
instructions  are  clear.  By  working  out  the  preliminary  indicators  and 
applying  them,  it  is  often  possible  to  add  new  details  to  the  instructions, 
as  well  as  to  understand  more  clearly  the  nature  of  the  classes. 

At  this  stage  it  is  particularly  important  to  pay  attention  to  the  deviant 
cases , No  matter  how  carefully  the  first  set  of  instructions  have  been 
worked  out,  some  responses  will  occur  which  do  not  fit.  The  indicators 
may  not  a4>ply,  or  they  may  seem  fuziy  and  vague.  These  cases  should 
be  studied  in  detail,  for  they  may  require  a new  set  of  indicators,  or  a 
new  set  of  classes. 

Further,  it  is  likely  that  a small  group  of  "unknown"  or  “unclassifiable” 
respondents  will  remain  throughout  the  coding  process.  If  this  group  is 
large,  the  instructions,  classes,  or  data  are  poor.  On  the  other  hand,  if 
this  class  is  minute,  it  will  not  affect  the  conclusions  to  any  important 
degree. 
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5.  code  all  the  answers.  Having  gradually  fitted  the  instructions  to 
the  classes  needed,  an  attempt  to  code  all  the  answers  should  then  be 
made.  Here,  again,  it  is  necessary  to  test  the  coding  of  one  person  or  group 
against  that  of  another.  It  is  only  by  such  fitting  and  testing  that  it  is 
possible  to  check  whether  or  not  merely  impressionistic  judgments  have 
served  as  the  basis  of  interpretation. 

Special  problems.  Before  showing  how  qualitative  coding  is  applied  to 
communication  content,  such  as  newspapers,  radio  programs,  speeches, 
etc.,  some  common  problems  which  occur  often  will  be  discussed. 

A common  result  of  the  attempt  to  code  is  the  discovery  that  one  has 
failed  to  clarify  the  concept  itself  by  which  the  comments  are  to  be  or- 
dered.  For  example,  a question  about  “belief  in  democracy"  may  elicit 
cjmments  which  are  difficult  to  organize,  precisely  because  the  concept 
itself  has  not  been  properly  defined.  In  some  cases,  the  problem  can  be 
solved  by  a series  of  questions  or  by  wording  them  more  precisely.  In 
other  cases,  it  may  be  necessary  to  think  through  the  problem  f nee  more. 
For  example,  if  an  attempt  to  order  the  comments  in  terms  of  “shop 
morale”  confr  ses  job  satisfaction,  acceptance  of  group  standards  in  the 
shop,  belief  in  job  mobility,  or  o'her  possible  meanings  of  this  term, 
only  confusion  in  coding  can  result.  In  such  a case,  the  focus  of  the  re- 
search  must  be  more  clearly  defined. 

The  failure  to  elicit  the  data  desired  is  another  type  of  problem  which 
is  sometimes  met.  Usually,  this  will  be  caused  by  faulty  wording  of  the 
original  question.  Thus,  one  might  use  a direct  question,  such  as  “What 
made  you  commit  this  crime?”  when  the  aim  is  to  obtain  information  on 
the  process  of  deciding  to  undertake  a particular  criminal  act.  Many 
respondents,  however,  will  answer  in  terms  of  familial  backgrounds, 
slum  neighborhoods,  gang  behavior,  etc.  Then,  of  course,  it  is  necessary 
to  formulate  the  question  differently. 

Coding  may  result  in  an  equally  disheartening  discovery— that  most 
of  the  answers  prove  to  be  of  one  type.  Thus,  the  earlier  question  about 
residential  mobility,  "Why  aid  you  decide  to  buy  a house?"  may  elicit 
a set  of  answers  which  may  all  be  classifiable  as  cost  factors.  Other  ques- 
tions indicate  clearly,  however,  that  cost  cannot  be  the  main  factor. 
Respondents  do  not  know  much  about  comparative  costs,  and  most  of 
their  comments  suggest  that  familial  discussions  did  not  deal  with  costs 
to  any  great  extent.  We  then  see  that  "cost"  if?  a kind  of  rationalization 
for  other  motivations,  in  terms  which  are  acceptable  in  conversations  with 
friends  and  outsiders.  The  answers  to  the  formal  question  may  have  been 
so  nearly  identical  merely  because  the  basic  question  was  not  really  being 
answered. 

Such  “one-class"  tabulations  often  result  when  the  question  is  an- 
swered by  a clich£:  “I  decided  to  become  an  engineer  because  I’ve  always 
been  interested  in  it,”  ”We  were  married  because  we  were  in  love.”  “I 
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never  punish  my  children  because  they  must  be  allowed  to  express  their 
personalities  ” Reformulation  of  the  question,  or  the  use  of  p^obe  ques- 
tions,  may  be  required.  In  any  event,  the  coding  process  clearly  points 
to  a set  of  answers  which  seem  to  require  fuither  analysis  before  being 
accepted  as  satisfactory. 

On  the  other  hand,  the  attempt  to  find  order  in  esstntially  unstruc- 
tured data  may  lead  to  further  insights,  useful  in  understanding  the 
tabulations  already  made.  For  example,  in  one  unpublished  study  there 
were  questions  which  dealt  with  ethnic  opinions  ami  friendship  patterns. 
These  answers  were  adequately  coded  in  terms  of  attitudes  toward  various 
ethnic  groups  and  also  in  terms  of  friendship  patterns.  However,  thh 
process  suggested  that  the  comments  contained  further  data.  When  the 
respondents  spoke  of  their  friendships  with  members  of  orher  ethnic 
groups,  they  made  "individual  exemptions";  t\c\,  their  friends  were  dif- 
ferent. Their  Irish  friends  were  not  pugnacious  or  of  a low  culture,  or 
their  Jewish  friends  were  not  commercial-mil  ded  (or  radical).  Conse- 
quently, a further  coding  operation  was  carried  out,  in  terms  of  the  indi- 
vidual exemptions  used.  Later  cross  tabulations  demonetised  the  fruit- 
fulness of  this  operation,  since  a relationship  existed  between  the  degree 
of  ethnic  prejudice  and  the  types  of  individual  exemptions  made. 

Attempting  to  make  procedures  explicit  is  merely  following  one  re- 
quirement of  scientific  vork—that  operations  must  be  repeatable  by 
others.  If  interview  protocols  are  seen  as  'observations,"  it  is  clearly  neces- 
sary that  these  observations  actually  be  classifiable  by  others  as  they  are 
by  the  researcher.  When  these  observations  are  neatly  structured,  the 
problem  of  classification  has  been  solved.  However,  many  records  such 
as  case  records  in  psychological  clinics  or  social-work  agencies,  or  observa- 
t:ons  of  children,  or  protocols  of  free  answers  to  open-ended  questions, 
may  be  closer  to  the  "real"  attitudes  of  our  sample.  Qualitative  coding 
is  one  set  of  techniques  for  locating  and  specifying  the  order  which  exists 
in  such  materials,  and  to  which  one  theoretical  questions  point* 


CONTENT  ANALYSIS 

We  shall  not  at  this  point  attempt  to  introduce  the  substantive  field  of 
communication  analysis,  which  has  had  its  main  development  within 
the  past  generation.  All  that  we  wish  to  do  is  point  out  that  the  data 
from  all  fields  of  communication  may  also  be  treated  as  sociological  data. 
Whether  we  wish  to  understand  the  T'ang  Dynasty  in  China  or  the 
Protestant  Revolution,  the  propaganda  of  revolution  or  of  reaction,  the 
radio  programs  of  the  lower  class  housewife  or  of  the  highbrow,  the 
available  materials  must  be  somehow  coded.  Whether  these  materials 
support  or  weaken  a hypothesis  in  communication  theory  can  only  be 
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determined  by  the  usual  criteria  of  good  research  design.  All  that  is  at- 
tempted here  is  to  show  how  such  materials  can  be  exploited  as  observa- 
tions, so  that  their  underlying  order,  if  any,  is  clearly  demonstrated, 

A simple  case  from  the  field  of  communication  follows,  in  which  such 
coding  is  indispensable  for  content  analysis.  While  measuring  the  kinds 
of  stimuli  to  which  the  newspaper  public  was  exposed  during  a given 
period,  a researcher  may  come  to  the  conclusion  that  a particular  news- 
paper was  in  sympathy  with  the  Nazis  at  that  time,  A number  of  people 
may  object  to  this  statement,  pointing  out  that  the  owner  is  a respectable 
man  in  the  community,  that  he  is  “a  100  per  cent  American/'  that  he  does 
not  belong  to  any  political  party,  and  so  on. 

If  the  researcher  has  been  following  a merely  intuitive  analysis,  he 
would  have  to  argue  the  point  interminably,  If  he  cites  an  example, 
they  may  cite  a counter-example  or  an  explanation.  He  may  be  charged 
with  bias,  with  not  understanding  the  situation,  and  so  on.  Many  of  our 
daily  arguments  with  friends  are  similarly  inconclusive, 

However,  a good  sampling  plan  for  the  newspaper  issues  analyzed,  an 
adequate  logical  structure  uniting  the  data  and  the  conclusions,  and  a 
systematic  technique  for  coding  the  content  of  the  newspaper  make  it 
possible  to  provide  genuine  evidence,  such  as  Table  1 contains,  With 

TABLE  I 

Number  of  Statements  in  The  True  American  Consistent  with  or  Opposed  to 
Nazi  Propaganda  (March  3,  1943  to  December  31,  1943)  • 


Consistent 

Opposed 

The  U.S.  is  Internally  corrupt 

301 

34 

The  foreign  poPdes  of  the  U.S.  are  un-Christian 

41 

0 

The  President  of  the  U.S.  is  objectionable 

150 

17 

Great  Britain  is  internally  corrupt 

34 

0 

The  foreign  policies  ot  Great  Britain  are  uni-Christian 

80 

2 

Nazi  Germany  Is  just  and  virtuous 

56 

2 

Japan's  foreign  policies  are  morally  justifiable 

22 

6 

Japan  Is  powerful 

30 

2 

The  U.S.  is  weak 

1* 

41 

Nari  Germany  is  strong 
The  U.S.  and  the  world  are  menaced  ty; 
a.  Communists 

41 

2 

55 

0 

b.  Jews 

99 

0 

c.  Meddlers  In  the  affairs  of  other  countries 

45 

0 

* See  Harold  D.  Lass  veil,  Nathan  Leites,  ct  of..  Language  of  Politics  (New  Yoik: 
Stewart,  1949),  Chap.  9,  "Detection:  Propaganda  Detection  ar  d the  Courts/’  pp.  173-233, 
esp.  the  tab!e  on  p.  187.  Table  above  is  adapted  from  Lasswell’s  presentation. 

this  tabic  might  go  a comparison  with  other  publications,  such  as  the 
second  tabulation  here, 
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TABLE  2 


Percentage*  of  Anlde*  and  Editorials  Consistent  with  Nazi  Themes  • 


Magazine 


Percentage 


Time  Magazine 
True  American 
Liberty 

Saturday  Evening  Post 


0.11 

55.67 

1.12 

054 


• See  similar  tables  In  Harold  D.  Lass  well,  Nathan  Lelies,  ef  of.,  Language  of  Politics 
(New  Yorkf  Stewart,  1949),  pp.  19S-193.  This  table  is  an  example  of  Unwell’*  'cor* 
slstency  tesi”;  the  former,  o(  his  "parallel  test." 

Without  going  into  the  logic  of  the  research  design,  which  was  devel- 
oped during  the  war  and  used  for  actual  court  pioof,  we  can  see  that  such 
pinpoint  use  of  data  is  at  the  polar  extreme  from  an  intuitive  approach, 
it  will  be  noted  that  the  researchers  have  actually  cotinted  the  presence 
or  absence  of  Nan  themes  within  a publication.  Whethtr  the  focus  is  on 
editorials  alone,  or  on  the  entire  publication,  these  examples  show  a 
translation  from  the  qualitative  complexity  of  the  total  content,  to  the 
quantitative  simplicity  of  counting  Nari  themes.  Since  these  themes  are 
already  available,  the  coding  problem  would  not  be  difficult.  For  high 
reliability,  coding  instructions  would  simply  list  the  major  themes  to  be 
observed,  and  coding  the  publication  over  a selected  peiiod  would  yield 
an  accurate  count,  which  could  be  verified  by  other  tesearchers.  Of 
course,  not  all  the  material  from  this  publication  is  utilized.  Many  other 
analyses  could  be  made,  with  relevance  to  other  problems,  and  "theme" 
analysis  is  only  one  possible  unit  of  investigation. 

Let  us  now  select  another  example,  dealini;  with  the  status  of  various 
ethnic  groups  in  the  Urited  States,  as  depicted  in  popular  fiction.  A great 
number  of  interning  critical  essays  could  be  written  on  the  subject, 
depending  on  the  writer's  literary  knowledge.  These  might  be  suggestive 
and  stimulating,  but  most  of  them  are  open  to  the  criticism  that  the 
reader  "knows  of  exceptions"  or  he  "does  not  see"  how  the  wn'ter  ar- 
med at  such  conclusions.  By  developing  an  adequate  sampling  plan  and 
a set  of  clear  coding  opetations,  it  is  possible  to  demonstrate  a number 
of  facts  about  popular  fiction  whith  do  not  depend  on  the  researcher's 
personality  or  insight  alone  but  which  can  be  tested  by  any  trained 
social  scientist 

One  study  of  th:s  sotl  focused  on  the  treatment  of  ethnic  groups  in 
popular  short  stoiict.*  Tie  method  of  classifying  the  data  was  fairly 
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simple,  Each  story  has  a number  of  characters,  and  each  of  these  char* 
acters  is  described  more  or  less  fully.  The  student  can  extract  this  list  of 
characters  with  their  descriptions  Irom  each  story. 

Any  :uch  set  of  characters  can  then  be  analyzed,  in  order  to  discover 
whether  there  is  any  identification  of  class,  race  nhnic  origin,  religion, 
occupation,  the  goals  which  they  seek,  and  so  on.  The  process  of  develop- 
ing explicit  coding  operations,  as  noted  previously,  depends  on  the 
purpose  of  the  analysis  and  on  the  difficulty  of  securing  adequate  indi* 
cators  for  the  desired  classes.  For  the  case  in  question,  let  us  take  a few 
examples  of  hypothetical  characters  as  met  in  popular  fiction.  A simple 
decision  might  be  reached  on  the  following  individual,  who  is  presented 
with  the  description  used  in  the  story  about  him: 


Character 

How  Identified 

Treatment  in  the  Story 

John  Adams 

White  (story  Illustration) 
Descendant  of  New  England 
family 
1 awyer 

High  Income  (has  custom-built 
convertible,  seashore  summer 
cottage) 

Episcopal  (has  conversation 
with  minister  regarding  early 
church  experience) 

Hero  of  the  story  v 

Idealist  (is  trying  to  force  real- 
tors to  accept  a dly  plan 
which  will  beautify  the  com- 
munity, but  reduce  their  in* 
come) 

Similarly,  (he  following  character  seeim  adequately  identified  to  allow 
some  judgment  concerning  his  treatment  in  the  story: 

Character 

How  Identified 

Treatment  in  the  Story 

Johnny  Turtfdu 

White 

Came  to  this  country  from  Italy 
at  the  age  of  three  (father 
speaks  broken  English) 

High  Income  (spends  vacations 
at  expensive  resotu,  owns 
buildings) 

Catholic 

Minor  role.  He  ir  defeated  by 
hero 

Makes  money  from  numbers 
rackets 

Rejects  his  old  priest 

Cynktl  materialist  (expresses 
interest  onty  in  hi*  suctm; 
bribes  city  officials) 

Any  such  sketches  would,  of  course,  be  only  the  first  steps  toward 
more  precise  directions  for  identifying  the  ethnic  backgrounds  and  the 
roles  which  the  characters  play.  There  would  be  fewer  data  for  m*nor 
characters,  and  instructions  must  be  flexible  enough  to  allow  identifica- 
tion by  using  any  one  of  the  many  indicators  which  might  be  used. 
For  example:  (1)  the  ethnic  membership  might  be  specifically  noted; 
(2)  obvious  ethnic  names  might  be  used;  (S)  some  supposedly  ethnic 
phrases  may  occur;  (4)  membership  in  ethnic  organizations  may  be 
cited;  or  (5)  the  home  neighborhood  may  be  described  closely  enough  to 
allow  identification.  A similar  set  of  possibilities  exists  for  "treatment  in 
the  story,"  although  this  is  somewhat  more  difficult,  invoking  as  it  does 
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a set  of  cues  to  locate  and  specify  to  what  extent  th*re  is  "discrimina- 
tion" against  the  ethnic  group  number.  So  long,  however,  as  the  cate- 
gory, "unknown"  is  not  a large  one,  the  basic  conclusions  will  not  be 
affected, 

Note,  iurther,  that  for  many  minority  groups  there  may  be  no  repre- 
sentatives at  all.  This  is  not  surprising,  since  popular  fiction  must  rely 
upon  commonly  recognized  ethnic  images.  Consequently,  in  this  case 
the  researchers  used  these  categories:  "The  Americans,"  ’The  Anglo* 
Saxon  and  Nordic  minorities  and  foreigners,"  and  "The  Others"  (Jews, 
Italians,  Negroes,  etc.).  The  treatment  of  these  groups  in  the  fiction 
sample  was  then  tabulated  as  shown  in  the  accompanying  table.  "The 

TABLE  S 

A Content  Analysis  of  Ethnic  Groups  in  Current  American  Literature 


The 

Americans, 
Per  Cent 

the 

Anglo-Saxons 
and  wordics. 
Per  Cent 

The 
Others. 
Per  Cent 

Major  characters 

52 

50 

50 

Approved  characur* 

80 

n 

62 

Highest  socioeconomic  status 

50 

24 

16 

Top  occupations 

'‘Heart"  goali  (f.e.,  idealist,  lose,  family, 

59 

29 

20 

as  against  money,  po*er,  etc) 

69 

61 

49 

rule  seems  to  be  that  the  character  receives  better  treatment  the  closer 
he  is  to  the  norm  of  The  American,  i.e.,  white,  Protestant,  English  speak- 
ing, Anglo-Saxon"  (page  18G). 

Thus,  by  working  out  fairly  simple  rules  for  identification  and  treat- 
ment, it  was  possible  to  determine  that  within  popular  short  stories 
there  is  a demonstrable  "discrimination"  against  the  various  minority 
groups,  especially  those  which  are  at  present  disadvantaged  in  the  society. 
By  using  fairly  iarge  categories,  the  problem  of  decision  was  reduced 
considerably. 

The  data  contained  in  the  short  stories  were  not  originally  presented 
or  organized  for  the  purposes  of  research,  but  by  formulating  instruc- 
tions for  locating  and  identifying  the  characters  and  the  themes  which 
are  used  to  describe  them,  the  coders  could  reliably  classify  the  data  as 
presented.  The  further  analysis  of  the  meaning  of  these  data  will  always 
depend  on  the  goals  of  the  research,  but  the  data  ate  no  longer  guesses  or 
hunches:  they  are  empirically  verifiable  by  other  observers  if  the  same 
operations  are  followed. 

It  is  also  clear  that  there  are  many  wiiilr  of  the  communication  content 
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which  can  be  used  for  sampling  and  analysis,  from  small  units  such  as 
words  or  symbols,  to  journal  issues  or  volumes.  Similarly,  there  are  many 
ways  of  organizing  the  materials:  by  symbol  count,  by  item  analysis,  by 
thematic  analysis,  in  terms  of  the  structure  of  the  ideas  and  facts  pre- 
sented, or  even  (in  propaganda  materials)  in  terms  of  the  campaign, 
Clearly,  it  is  equally  possible,  in  the  case  of  content  addressed  to  an 
audience,  to  analyze  these  spoken  or  written  materials  in  terms  of  the 
response . However,  for  such  further  treatments  of  content  analysis  as  a 
substantive  field,  the  student  should  read  the  literature  died  at  the  end 
of  this  chapter. 

THE  CASE  STUDY 

The  case  study  approach  may  be  introduced  at  this  point,  since  tradi- 
tionally it  has  been  thought  of  as  a kind  of  qualitative  analysis.  More 
important  for  our  purposes,  however,  is  tie  fact  that  it  is  through  the 
developments  of  qualitative  techniques,  such  as  qualitative  coding,  that 
ihe  case  study  can  be  most  efficiently  used  in  social  research. 

References  in  the  sociological  literature  to  “the  case-study  method’1  are 
decreasing.  In  part,  this  would  seem  to  be  one  result  of  the  false  dis- 
tinction between  "the  statistical  approach,"  and  "the  nonstatistical  ap- 
proach," as  a consequence  of  which  the  case  study  sometimes  identified 
with  the  use  of  less  reliable  research  techniques.  It  is  often  thought  of 
as  a kind  of  intuitive  approach,  derived  from  much  participant  observa- 
tion and  using  all  sorts  of  personal  documents  such  as  diaries,  letters, 
autobiographies,  etc.,  without  adequate  sampling  design  or  checks  on  bias 
or  distortions  resulting  from  personal  views  of  social  reality. 

However,  such  an  identification  fails  to  distinguish  the  case-study 
approach,  a method  of  looking  at  social  reality,  from  the  specific  research 
techniques  which  accompanied  it  in  the  research  of  a generation  ago.  It 
is  true  that  much  of  case-study  research  at  that  time  failed  to  follow 
good  sampling  design  and  often  came  to  conclusions  with  no  explicit 
description  of  the  operations  which  led  to  those  conclusions.  However, 
this  was  true  for  most  social  research,  and  in  any  event  there  is  no  in- 
trinsic connection  between  the  case  study  and  unsophisticated  research 
techniques. 

Similarly,  the  cue  study  approach  has  bc*n  praised  as  a set  of  useful 
research  techniques,  such  as  the  use  of  personal  documents,  probing  many 
facets  o!  the  respondent1!  life,  adjusting  the  wording  of  questions  to  fit 
the  peculiar  experience  of  the  respondent,  sharing  experiences  with  the 
individuals  being  studied,  collecting  life  histories,  gathering  data  over 
the  lifetime  of  the  individual,  etc.  Such  praise  is  equally  unwarranted, 
since  these  techniques  may  be  used  in  almost  any  type  of  social  research. 
The  use  of  qualitative  coding  permits  the  use  of  personal  documents  in 
a systematic  way.  The  interview  guide  is  one  form  of  the  questionnaire 
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which  allows  flexible  and  alternative  wordings  ol  questions,  when  that 
is  needed.  Participant  obsetvation,  while  it  has  not  been  systematically 
studied  in  recent  years,  is  used  when  possible  in  the  early  stages  of 
formulating  the  research  problem.  Life  histoiies  may  also  be  used  in 
the  preliminary  stages  of  tesearch,  or  as  the  basic  body  of  data,  and  may 
be  employed  in  conjunction  with  the  techniques  of  qualitative  coding 
for  many  types  of  social  research.  In  short,  modern  social  research  has 
attempted  gradually  to  systematize  and  make  precise  the  various  special 
“qualitative”  techniques  which  were  once  thought  to  be  the  exclusive 
charvteristics  of  the  case-study  approach. 

The  case  study,  then,  is  not  a specific  technique.  It  is  a way  of  or- 
ganizing social  data  so  as  to  preserve  the  umtaty  character  of  (he  social 
object  being  studied.  Expressed  somewhat  differently,  it  is  an  approach 
which  views  any  social  unit  ar  a whole.  Almost  always,  this  means  of 
approach  includes  the  develofment  of  that  unit,  which  may  be  a per- 
son, a famiiy  or  other  social  group,  a set  of  relationships  or  processes 
(such  as  family  crises,  adjustment  to  disease,  friendship  formation,  ethnic 
invasion  of  a neighborhood,  etc.),  or  even  an  entire  culture. 

Belore  pointing  out  some  of  the  difficulties  and  advantages  of  this 
approach,  let  us  clarify  it  somewhat  by  contrasting  this  holistic  view 
with  that  of  much  modem  research.  A simple  case  taken  from  public 
opinion  polling  yields  an  extreme  comparison.  Suppose  the  respondents 
are  asked  a series  of  questions  about  their  attitudes  toward  war.  For 
further  analysis,  a series  of  cross-tabulation  questions,  such  as  age,  sex, 
economic  ranking,  urb.nrvral  residence,  and  veteran  status,  may  also 
be  asked.  The  answer*  to  eath  question  are  classified,  in  ordtr  to  cross- 
tabulate  by  strata.  Thus  the  answers  to  the  question  on  economic  ranking 
are  put  into  the  classes  A,  B,  C,  and  D.  Perhaps  three  to  five  age  groupings 
and  two  sex  groupings  (male  and  female)  may  also  be  used.  Having  thus 
classified  and  coded  the  data,  all  these  can  be  transferred  to  the  punch 
cards. 

However,  from  this  point  onward  the  individual  unit,  the  person,  dis- 
appears from  the  analysis,  which  instead  merely  compares  traits.  Thus, 
for  example,  cross  tabulations  may  reveal  whether  the  old  object  to  war 
more  vigorously  than  do  the  young,  or  whether  women  disapprove  of  war 
more  vigorously  than  do  men,  or  perhaps  how  the  various  economic  rank- 
ings compare  in  their  attitudes  toward  war. 

The  answer  to  each  of  these  questions  is  obtained  by  sorting  out  all 
the  cards  into  the  first  group  of  trails  (old  at  against  young),  then  tabu- 
lating under  each  group  the  responses  to  questions  about  war.  For 
example,  the  cards  could  be  sorted  into  those  with  a punch  meaning  “65 
and  over,”  one  which  means  "under  20,”  and  another  group  which  in- 
cludes all  the  rest  (or  any  other  age  classification  desired).  Then,  for  eath 
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such  group  obtained  in  this  fashion,  the  responses  to  the  questions  con- 
cerning war  are  tabulated. 

Note,  however,  that  it  is  only  the  trail*  which  are  immediately  in- 
volved in  the  cross  tabulation.  All  other  characteristics  on  each  card  are 
ignored.  What  is  being  emphasized  here  is  not  the  narrowness  of  the 
questions,  for  even  the  most  elaborate  research  represents  a relatively 
thin  slice  of  any  respondent’s  life.  Rather,  it  is  important  to  see  that 
most  cf  our  analyses  are  cross-tabulating  the  distribution  of  one  /raft 
with  the  distribution  of  another.  This  is  equally  the  case  for  elaborate 
cross  tabulations,  as  it  is  for  correlations.  The  individual  representing 
the  "wholeness”  of  those  traits  does  not  figure  in  these  analyses. 

Such  a characterization  is,  of  course,  essentially  negative  and  seems 
almost  a polemic  cliche.  In  order,  then,  to  specify  the  case-study  approach 
as  an  alternative  mode  of  handling  respondents,  groups,  or  social  rela- 
tionships, let  us  attempt  to  show  wherein  the  differences  lie.  In  a more 
practical  form,  how  can  the  wholeness  of  cases  be  preserved?  We  shall 
attempt  to  outline  the  answer  briefly  under  four  nther  complex  rubrics: 
(1)  breadth  of  data;  (2)  levels  of  data;  (3)  formation  of  indexes  and  types; 
and  (4)  interaction  in  a time  dimension. 

A few  preliminary  remarks  are  in  order,  however.  First,  the  wholeness 
of  any  object,  whether  physical,  biological,  or  social,  h an  inteUec.uat 
construct.  Concretely,  there  are  no  limits  which  define  any  process  or 
object.  Every  variable  ultimately  links  with  any  other.  As  theoretical 
biology  has  pointed  out,  from  some  points  of  view  even  the  living  animal 
is  a construct,  and  the  point  at  which  the  animal  stops  and  "the  environ- 
ment” begins  is  arbitrarily  defined.  Similarly,  "the  individual,"  or  "the 
formation  of  a gang"  must  be  such  a construct,  defined  in  terms  that 
are  most  fruitful  for  the  research  problem. 

From  this  point  we  see,  further,  that  not  oniy  is  it  difficult  to  set  the 
ffmifj  of  any  social  object,  but  it  is  difficult  to  state  at  what  point  it  is 
profitable  to  stop  gathering  drta  about  the  object  so  delimited.  As  has 
been  noted  before,  an  infinite  number  of  observations  may  be  made 
about  even  the  dullest  or  most  unimportant  person  Consequently,  neither 
the  case  uudy  nor  any  other  approach  can  be  characterized  as  the  analysis 
of  the  individual  in  all  his  uniqueness.  The  point  at  which  data  are 
adequate  must  be  determined  by  the  research  problem  itself.  There  is 
no  inherent  or  intrinsic  limit.  One  implication  of  this  must  be  added: 
when  we  speak  of  the  "social  object  as  a whole"  w*e  cannot  mean  the 
"social  object  as  a unique  unit."  It  is  precisely  the  unique  elements  in 
any  phenomenon  which  are  eliminated  by  scientific  abstraction.  To  the 
extent  that  characteristics  are  really  unique,  it  is  impossible  to  account 
for  them  in  terms  of  scientific  generalization.-  Consequently,  the  case- 
study  method  cannot  be  thought  of  as  capturing  the  unique,  but  as  at* 
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tempting  to  keep  together,  as  a unit,  those  characteristics  which  are  rele- 
vant to  the  scientific  problem  being  investigated. 

Breadth  of  data.  Clearly,  one  ol  the  ways  by  which  the  unit  is  studied 
as  a totality  is  the  collection  of  a broad  array  of  data  about  it.  This  is  true 
whether  the  subject  is  a set  of  social  relationships,  a person,  or  a group. 
The  "person"  who  is  recorded  on  a polling  schedule  is  not  on1./  dis- 
solved by  becoming  a set  of  trails  individually  tabulated;  he  almost  fails 
to  exist  from  the  beginning,  by  \irtue  of  the  narrow  range  of  the  data 
concerning  him.  Although  mere  quantity  of  data  is  not  sufficient,  since 
the  collection  must  be  guided  by  the  research  problem,  there  is  a greater 
opportunity  to  grasp  the  pattern  of  the  individual's  life  if  a substantial 
body  of  data  concerning  many  facets  of  that  life  is  available. 

Levels  of  data . The  case-study  method  is  further  distinguished  by  the 
use  of  data  from  other  abstract  levels  than  the  purely  sociological.  Al- 
though many  sociologists  do  incorporate  economic,  political,  psycho- 
dynamic, and  even  biological  data  into  their  analyses,  social  research  has 
in  the  main  been  moving  in  the  direction  of  a clearer  definition  of  the 
sociological,  together  with  better  techniques  for  abstracting  it  from  the 
complex  of  other  relationships.  In  the  case-study  method,  such  data  from 
other  levels  are  deliberately  incorporated  into  the  records.  Although  it 
may  be  pointed  out  that  such  data  should  be  viewed  in  terms  of  their 
sociological  meaning,  so  that  a biological  defect,  fov  example,  is  seen  as 
it  affect*  social  relationships,  it  is  clear  that  recourse  to  other  levels  docs 
give  added  dimensions  to  the  individual  being  studied.  When  we  see 
the  individual  in  his  total  network  of  relationships,  it  is  more  difficult 
to  lose  sight  of  him  as  a unit. 

Formation  of  indices  and  types.  However,  the  most  important  tech- 
nique in  preserving  (he  wholeness  of  the  social  unit  is  the  development  of 
typologies  and  indexes,  so  that  the  various  traits  are  actually  used  in 
characterizing  the  unit.  Note  that  this  technique  is  not  confined  to  the 
case-study  method  and  is  used  in  any  qualitative  analysis.  Furthermore, 
elaborate  cross  tabulations  attempt  the  same  goal.  Thus,  cross  tabula* 
lions  in  terms  of  sex,  age,  and  socioeconomic  ranking  serve  to  character- 
ize those  who  are  (for  example)  male,  and  aged  20  to  2y,  and  poor. 

The  same  operations  may  be  performed  in  a case  study,  and  tbe 
problem  may  be  somewhat  simplified  by  qualitative  coding  beforehand. 
Because  of  the  breadth  and  added  dimensions  of  the  data,  moreover,  the 
choice  of  which  type  or  class  the  case  falls  into  will  be  simplified.  In 
addition,  by  the  time  the  types  begin  to  emerge  clearly,  enough  of  the 
process  of  interaction  between  various  facets  of  the  individual*!  life  will 
be  known  so  that  this  choice  may  be  made  with  greater  certainty.  Finally, 
cases  will  be  concerned  with  developing  '’natural  types,"  that  is,  types 
which  are  already  discerned  in  the  culture  (such  as  "lovers'  quarrels,"  'life 
of  the  party,"  "village  philosopher,"  ot  the  religious  revival  meuing). 
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and  the  case-study  approach  can  ordinarily  single  out  such  types  at  an 
early  stage. 

Interaction  in  a time  dimension.  To  the  breadth  and  additional  levels 
of  the  data  gathered  in  the  case-study  method  must  be  added  the  em- 
phasis upon  process  and  time . Any  cross  tabulation  is  essentially  a static 
analysis,  although  the  relative  numbers  of  cases  found  in  various  subcells 
may  be  used  to  test  hypotheses  of  interaction.  For  example,  according  to 
some  theories  we  would  expect  to  find  a causal  relationship  between  I 
class  position  and  continuation  in  school.  It  is  possible,  then,  to  cross- 
tabulate  these  characteristics  (possibly  holding  IQ  constant)  and  thus 
check  the  hypothesis. 

Nevertheless,  we  do  not  observe  "process”  in  such  tabulations.  We  are 
limited  to  the  traits  on  the  punch  card,  which  have  not  recorded  any  of 
their  relationships  to  one  another.  In  the  case  study,  the  attempt  to  hold 
these  characteristics  together  in  both  the  data  gathering  and  the  data- 
analysis  phases  emphasizes  the  changes  in  time,  as  well  as  the  processes 
by  which  those  changes  took  place.  The  period  of  time  may  be  short  or 
long.  An  individual's  life  from  childhood  until  the  time  of  a study,  or 
the  formation  and  reformation  of  cliques  in  Congress  over  a very  short 
period  of  months  may  be  studied  equally  well.  In  either  case,  however,  j 
the  concern  is  with  recording  the  relevant  characteristics  as  they  appear 
in  interaction,  not  merely  recording  them  at  two  separate  instants  in 
lime  for  a before-and-after  comparison.  The  emphasis  on  the  interaction 
process  makes  the  maintenance  of  the  unitary  character  of  the  social  ob-  , 
ject  somewhat  simpler.  j 

Prohtems  of  the  case  study  technique.  Most  of  the  ditficulties  in  the 
use  of  this  method  can  be  reduced  to  one,  although  a motv  complex  classi- 
fication is  possible.  Interestingly  enough,  the  basic  danger  in  its  use  is  the 
response  of  the  researcher . The  researcher  comes  to  feel  a false  sense  of 
certainty  about  his  own  conclusions.  The  danger,  then,  does  not  lie  in  i 
any  technical  weakness  of  this  approach  to  social  processes  or  individuals  j 
as  wholes. 

The  student  may  feci  that  this  is  a constant  danger  in  all  research.  Yet 
it  should  be  remembered  that  in  most  research  there  are  constant  re- 
minders that  a my  narrow  range  of  experience  is  represented  by  the 
data.  However  long  our  interviews,  gaps  are  all  too  obvious,  aod  there  is 
noway  to  fill  them.  However,  each  ease  which  is  developed  as  a unit  takes  j 
on  complete  dimensions  in  the  mind  of  the  researcher,  He  comes  to  feel 
certain  that  he  could  answer  many  more  questions  about  his  case  than 
can  be  answered  from  his  file  records.  The  case  has  a definite  form  and 
pattern,  and  as  the  researcher  probes  more  deeply  into  the  process  or  * 

person  being  studied,  he  finds  there  are  few  surprises  left.  It  is  quite  1 

comparable  to  our  feeling  of  certainty  about  our  close  friends,  or  the  j 
neighborhood  in  which  we  have  lived  for  years,  or  our  families.  There  is,  i 
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in  short,  an  emotional  feeling  of  certainty  which  is  much  strongc  • than 
in  the  case  of  ether  types  of  research.  This  is  particularly  true  as  com- 
pared with  much  survey  work,  in  which  the  analyst  has  only  the  com- 
pleted schedules  before  him  and  knows  he  cannot  capture  the  • iried 
experiences  of  the  many  interviewers  who  carried  out  those  interviews. 

Yet  the  student  can  test  the  accuracy  of  this  knowledge  very  easily, 
by  submitting  to  questioning  about  his  close  friends  or  his  own  family, 
the  "cases”  he  knows  best.  Most  of  us  find  that  our  feelings  of  certainty 
deceive  us.  We  know  far  fewer  data  about,  thesi  cases  than  we  believe. 
Important  facts  are  forgotten.  Other  facts  are  distorted,  or  surprising 
facets  of  these  lives  are  unknown  to  us.  For  example,  children  usually 
believe  that  they  "understand”  their  parents,  but  are  often  oblivious  to 
the  many  deep  conflicts  between  them.  Brothers  and  sisters  are  often 
unaware  of  important  experiences  happening  to  each  other  outside  the 
family  circle.  What  most  of  us  Icel  is  a kind  of  athomeness  with  these 
close  acquaintances,  so  that  we  fill  in  or  ignore  those  facts  or  dimensions 
of  the  person  which  are  really  unknown  to  us. 

This  danger,  then,  is  one  which  the  observer  himtelf  creates.  The 
consequences  of  this  feeling  of  certainty  are  many,  but  most  of  them 
can  be  grouped  under  one  main  heading:  a temptation  to  ignore  basic 
principles  of  research  design.  Since  the  researcher  feels  so  very  certain 
about  the  area  of  experience  he  is  investigating,  he  feels  no  need  to  check 
the  over  all  design  of  proof.  For  example,  after  the  student  has  collected, 
i say,  200  cases  of  juvenile  delinquency  from  *ocial-work  records,  supple- 
I mented  by  inlet  viewing  and  other  sources,  it  is  difficult  for  him  to  feel 
1 that  he  does  not  have  an  adequate  sample.  The  tange  of  delinquency 
j experience  in  his  cases  is  so  wide,  the  types  of  people  so  varied,  the  depth 
| of  detail  so  vivid,  that  the  researcher  ordinarily  has  a strong  conviction 
that  his  selection  is  "representative.”  He  was  not,  usually,  following  any 
known  pattern  of  sampling,  so  that  he  feels  certain  there  is  no  bias.  It 
must  be  repeated  at  this  point  that  we  are  not  separating  one  type  of 
researcher  from  any  other  type,  dny  investigator  who  absorbs  the  facts 
from  a large  number  of  cases  will  begin  to  feel  that  he  really  has  a satis- 
1 factory  sample,  no  matter  how  much  knowledge  he  has  about  sampling 
| design. 

The  result  is,  naturally,  a strong  temptation  to  extrapolate  unwar- 
rantedly.  Perhaps  an  equally  important  consequence  of  this  feeling  is 
the  failure  to  make  explicit  just  what  are  the  generaliraiiont  underlying 
the  analysis  of  the  cases.  One  boy’s  refusal  to  give  up  his  pilfering  may 
be  explained  by  his  desire  for  economic  gain.  Yet,  in  a similar  case,  and 
» without  explaining  the  bares  for  the  different  explanation,  another  boy's 
j refusal  to  give  up  thievery  may  be  explained  by  his  desire  to  humble  bis 
parents,  a psychodynamic  explanation.  Although  both  explanations  may 
be  correct,  we  are  likely  to  forget  that  any  such  analysis  requires  a system 
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of  explanation,  It  is  not  scientific  to  apply  various  explanations  which  by 
common  sense  or  intuition  seem  to  fit  particular  cases,  on  a purely  ad  hoc 
basis.  Some  set  of  generalization*  is  being  used,  and  it  is  necessary  to 
make  explicit  just  why  one  or  the  other  may  or  may  not  be  applied.  How- 
ever, when  the  investigator  feels  so  much  "at  home"  with  each  case,  he 
may  be  tempted  to  "feel"  the  right  explanation-even  though  a second 
reader  of  our  cases  might  come  to  an  entirely  different  explanation.  This 
is  what  is  known  as  cd  hoc  theorizing.  The  previous  reference  suggests 
a further  danger  resulting  from  this  feeling  of  certainly:  the  failure  to 
test  the  reliability  of  the  data  recorded,  the  classifications  used,  or  th$ 
analysis  of  the  data.  As  the  researcher  is  likely  to  point  out,  no  one  else 
knows  the  data  nearly  so  well  as  he,  so  that  no  one  else  could  check  his 
work  adequately.  Furthermore,  case  gathering  U a time-consuming  activ- 
ity, and  it  is  difficult  to  find  others  who  are  willing  to  study  the  cases  with 
such  completeness. 

Avoiding  these  problems . Nevertheless,  these  difficulties  can  be  avoided 
by  the  student  who  is  willing  to  follow  f x>d  research  planning.  Being 
warned  of  the  dangers  which  result  from  * ach  a feeling  of  certainty, 
he  will  develop  a research  design  which  takes  account  of  these  dangers. 
He  cannot  avoid  the  feeling  ot  certainty,  but  he  can  use  an  adequate 
sampling  pattern.  Knowing  that  his  sampling  is  good,  he  has  a rational 
basis  for  making  estimates  about  the  universe  from  which  it  was  drawn. 
Instead  of  relying  on  intuition  for  his  conclusions,  he  will  make  cettain 
that  whenever  he  analyzes  a given  factor,  there  are  actual  data  for  all 
case  records  concerning  that  factor.  Further,  he  will  develop  his  theoreti- 
cal framework  from  the  beginning  of  the  research,  so  that  he  avoids 
speculations  made  on  the  spot  to  fit  the  peculiarities  of  each  case. 

Furthermore,  he  will  attempt,  as  far  as  |x>ssible,  to  utilize  the  technique 
of  qualitative  coding  for  individual  factors  and  traits  which  are  amenable 
to  such  classifications.  If  he  is  to  use  categories  like  "selfish,"  or  "ad- 
justed," or  "anomic,"  he  will  develop  a set  of  instructions  for  deciding 
whether  a given  case  fells  into  the  category.  If  other  investigators  cannot 
use  such  instructions,  they  are  likely  to  contain  many  implicit  judg- 
ments, intuitions,  common  sense  guesses,  and  the  like.  The  good  student 
will  recognize  that  a description  of  research  operations  must  be  so  written 
that  other  scientists  can  repeat  them.  If  not,  e-ther  the  description  or  the 
operations  are  unclear.  The  student  must  make  this  test,  if  he  is  not  to 
remain  open  to  the  charge  from  others  that  they  "simply  can  t see"  what 
he  sees.  It  is  usually  good  procedure  to  carry  out  this  stage  of  analysis 
with  the  aid  of  a number  of  collaborators  or  assistants  who  can  act  as 
judges  of  the  reliability  of  even  simple  classifications. 

One  may  almost  claim  that  it  is  at  this  point-that  of  developing  cate- 
goric*, of  defining  and  delimiting  types  of  behavior  already  reoorded- 
that  the  case  study  requires  the  closest  criticism.  For  it  is  at  this  stage 
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that  the  systematisation  of  much  of  the  mass  of  data  begins,  that  Is,  con- 
densation, excision,  reinterpretation,  etc.  The  final  report  cannot  simply 
repeat  the  recorded  observations.  The  steps  in  this  transformation,  then, 
must  be  carefully  marked,  if  the  researcher  expects  to  hav^  his  conclu- 
sions accepted. 

It  is  worth  recording  that  this  problem  is  not  peculiar  to  sociology. 
When  the  historian  begins  to  reduce  the  mass  of  individual  observations 
from  original  sources,  he  faces  the  problem  of  showing  his  operations  to 
fellow  historians,  so  that  they  can  follow  them  exactly,  should  they  wish 
to  do  so.  Much  of  the  objection  of  social  scientists  to  the  published 
analyses  of  psychiatrists  and  psychoanalysts  is  simply  that,  whatever  the 
details  of  the  case,  there  is  no  clearly  outlined  set  of  operations  by  which 
the  next  scientist  can  use  the  same  data  to  come  to  the  same  conclusions. 
The  economist  who  studies  i)  < marketing  process  among  tobacco  farmers 
may  find  similarly  that  each  case  it  so  very  different  that  colleagues  are 
suspicious  of  condjsions  until  the  operations  of  the  analyst;  are  made 
clear.  As  noted  above,  the  best  way  of  clarifying  these  steps  is  to  develop 
definite  coding  imttuctions  for  most  of  the  important  qualitative  items, 
with  definite  tests  for  coding  reliability. 

Costs  in  time  and  money.  It  is  clear  thal  this  approach  is  a costly  one 
in  time  and  money.  Each  case  becomes  a research  in  itself,  and  the  colleo 
tion  of  even  100  adequately  documented  cases  may  easily  consume  2 
years  of  the  student’s  time.  Since  the  data  are  not  usually  standardised, 
the  system  of  keeping  records  and  of  developing  techniques  for  standard- 
iiing  the  observations  must  be  rather  elaborate.  Over  such  a lengthy 
period,  the  loss  of  potential  cases  from  the  sample  drawn  may  be  very 
great,  and  the  resultant  costs  of  tracing  cases  similarly  great. 

These  costs  need  not  deter  the  researcher  from  this  technique.  They 
simply  remind  hitn  of  a cardinal  fact,  thal  all  research  is  expensive,  and 
there  must  be  an  earl)  judgment  as  to  which  kind  of  study  will  be  most 
adequate  for  the  kinds  of  facts  needed.  For  some  types  of  analysis,  the  use 
of  case  studies  will  be  fruitful.  For  others,  a simple  polling  study  or 
mailed  questionnaire  may  be  adequate.  When  detailed  processes  of  social 
interaction  arc  being  analysed,  some  preliminary  recourse  to  case  study 
is  needed,  even  if  there  is  not  complete  reliance  upon  this  method.  One 
type  of  simple,  less  expensive,  substitute  is  a panel  sample,  which  is  re- 
interviewed over  a petiod  of  months. 

Adi'antagcs  of  case  study.  Nevertheless,  as  was  noted  previously,  most 
good  research  has  some  recourse  to  case  analysis,  even  if  it  docs  not  appear 
in  the  final  publication  of  results.  Its  fruitfulness  is  indicated  by  the 
preceding  discussion  of  its  characteristics.  Before  a final  questionnaire 
is  developed,  it  is  likely  that  the  cartful  researcher  will  attempt  the 
j analysis  of  many  cases  which  center  on  his  major  problem.  He  cannot 
I rely  on  the  reports  from  a few  trial  interviews.  He  will  want  to  go  over 
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the  field  of  interaction  with  a number  of  respondents,  in  great  detail,  to 
iind  out  just  how  their  life  pattern  is  being  affected  by  the  item  being 
studied*  For  example,  if  the  aim  is  to  develop  a good  schedule  for  the 
analysis  of  stratification,  simply  trying  out  a number  of  schedule  drafts 
will  usually  not  be  adequate.  The  student  must  attempt  to  develop  a 
body  of  much  more  intensive,  detailed,  and  subtle  observations  about 
many  individuals,  learning  about  their  responses  to  the  phenomena  of 
class  barriers,  mobility,  class  perspectives,  etc.  In  addition,  it  is  here  that 
the  added  data  allow  the  study  of  the  deviant  cases,  that  is,  those  which 
seem  to  negate  the  theory.  Such  analysis  may  lead  to  considerable  clari- 
fication of  the  theory  itself.  Such  observations  may  or  may  not  be  forma  V 
i?ed  into  definite  case  histories.  Such  a practice  is  recommended,  since  at 
this  exploratory  stage  the  student  is  likely  to  see  selectively,  that  is,  see 
those  items  which  seem  outstanding  to  him,  while  neglecting  perhaps  . 
equally  important  data.  Only  by  recording  them  for  analysis  and  then 
analyzing  them,  prior  to  the  next  stage  of  formulating  the  final  version  of  I 
questionnaire  or  schedule,  can  he  make  certain  that  all  the  pertinent  data  I 
are  being  incorporated  into  the  research  instrument.  j 

Of  course,  the  researcher  may  be  interested  in  certain  social  processes,  j 
such  as  the  courtship  process,  clique  formation,  posidivortc  adjustment, 
or  boys*  gangs,  ind  in  such  a rase  the  unit  selected  need  not  be  the  life 
history  of  the  individual  person.  It  will  instead  be  the  process  selected 
for  study.  Here  again  the  preliminary  study  of  the  processes  is  greatly 
facilitated  by  case  documents,  incorporating  the  pertinent  data  which 
will  guide  later  stages  of  the  research. 

Whethet  the  student  approaches  his  data  through  the  framework  of 
the  case  study  at  later  stages  of  the  investigation  will  depend  on  many 
factors.  As  was  noted  above,  the  time  and  money  at  bis  disposal  may  lead 
to  a decision  to  study  only  a limited  number  of  factors.  In  this  case,  he 
may  well  utilize  a questionnaiie  or  some  form  of  single  interview  as  his 
main  technique.  If  he  wishes  to  study  a much  wider  tange  of  data,  he  may 
be  willing  to  limit  the  number  of  cases.  He  will  ordinarily  not  be  sble  to 
generalize  safely  from  a small  number  of  cases,  but  it  is  often  true  that  the 
depth  of  insight  afforded  by  case  study  will  yield  fruitful  hjpotheses  [ 
for  a later,  full  scale  study. 

The  most  important  theoretical  advantage  of  the  rase  study  has  not 
been  fully  exploited  as  yet.  As  noted  earlier,  most  research  has  become 
"trait”  research,  that  is,  cross  tabulations  letween  traits  of  individuals. 
Even  when  these  individuals  are  being  studied  in  groups,  very  few  indi- 
viduals or  groups  as  units  are  being  analyzed,1  Even  though  many  of 
the  apparently  group  trails  tan  be  approached  through  combining  indi 

• for  a recent  diatwsioo,  set  Patrida  L.  Kendall  and  Paul  F.  Laursfek),  "Problems  of 
Surrey  Analysis,*’  ConfinwmVi  in  Sodaf  Restsnh,  Robert  R.  Merton  and  Paul  F.  La  ran 
fdd,  eds.  (Glencoe,  111.:  Free  Press,  I«0),  pp  ISM*. 
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vidual  data,  they  may  alto  be  approached  through  an  organizing  frame- 
work emphasizing  the  characteristics  of  the  group,  or  process,  or  social 
structure  as  a whole.  The  case  study,  attempting  to  organize  data  around 
the  unit  of  growth,  or  group  structure,  or  individual  life  pattern,  does 
force  the  researcher  to  think  in  these  terms  rather  than  fall  back  on  trait 
analysis  alone. 

Emphasis  should  also  be  placed  on  the  wider  range  of  personal  experi- 
ence which  the  use  of  the  case  study  gives  to  the  student.  This  advantage 
has  been  implicit  in  the  previous  discussion,  but  it  deserves  special  com- 
ment at  thb  point.  Precisely  because  of  the  narrowness  of  most  survey 
work,  the  researcher  actually  derives  most  of  his  wider  range  of  experience 
in  such  investigations  *>t  the  stage  of  analysis,  when  the  meaning  of  the 
questions  is  probed  more  deeply.  This  latter  stage  is  most  fruitful,  how- 
ever, if  there  has  been  a prior  period  of  absorbing  the  varied  experience 
of  others.  The  case  study  is  particularly  useful  because  of  its  attempt  to 
f:nd  the  meaning  of  the  recorded  data  within  the  life  of  the  individual, 
and  only  later  in  terms  of  classes  of  individuals.  Often,  too,  these  ex- 
periences yield  new  insights  because  of  their  very  differences  from  the 
ordinary  experiences  of  the  researcher;  he  may  never  have  been  divorced, 
been  a safecracker,  a pickpocket,  a member  of  a boys*  gang,  a member  of 
an  ethnic  group  becoming  Americanized,  etc.  In  a sense,  then,  he  is  able 
y ereby  to  live  many  quite  different  lives  by  sharing  these  experiences. 
Such  materials  are  not  only  useful  within  the  confines  of  his  particular 
research,  but  may  become  raw  materials  for  further  reflection  and  re- 
search. Since  the  researcher,  whether  academic,  industrial,  or  govern- 
mental, is  under  pressure  to  lead  his  life  entirely  within  his  own  middle- 
class  group,  attention  to  these  further  dimensions  of  social  life  prevents 
his  sociological  thinking  from  becoming  increasingly  narrow. 

It  is  seen,  then,  that  although  it  is  not  possible  to  identify  the  case- 
study  "method”  as  a particular  technique  for  eliciting  data,  it  is  a mode 
of  organizing  data  in  terms  of  some  chosen  unit,  such  as  the  individual 
life  history,  the  history  of  a group,  or  some  delimited  social  process.  In 
order  to  obtain  such  holistic  data,  one  may  use  all  the  techniques  which 
any  other  mode  of  organization  uses:  intensive  interviews,  questionnaires, 
self-histories,  documents,  case  reports  by  others,  letters,  etc.  Maintenance 
of  the  unitary  character  of  the  case  is  aided  by  the  breadth  and  added 
levels  of  data  gathered,  the  use  of  indexes  and  typologies,  and  the  empha- 
i sis  on  interaction  in  a time  dimension.  There  is,  then,  some  attempt  to 
I make  of  each  case  a research  in  itself.  This  is  a time-consuming  process, 

i and  the  collection  of  a large  number  of  cases  may  be  unnecessary,  if  the 

\ research  goals  are  limited.  The  absorption  in  the  detailed  material  of 
social  relationships  and  interaction  gives  the  researcher  a wider  range 
and  a greater  depth  of  experience,  which  may  be  of  great  use  in  interpret- 
ing the  data  and  in  further  research.*  However,  by  such  close  contact 
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with  the  individuals  or  groups  studied,  it  is  likely  that  the  observer  will 
have  a strong  effect  on  the  data  themselves.  At  least,  it  may  be  claimed 
that  this  danger  is  perhaps  greater  in  this  approach.  The  absorption  in 
the  data  also  creates  such  a feeling  of  certainty  about  hjs  knowledge,  on 
the  part  of  the  researcher,  that  he  is  often  tempted  to  be  less  cautious  in 
following  the  basic  rules  of  research  design.  Nevertheless,  for  preliminary 
research  in  any  field,  most  investigators  will  use  some  form  of  the  case 
study;  and  for  the  purpose  of  group  or  process  analysis,  as  against  the 
analysis  of  individual  traits  alone,  it  is  a highly  fruitful  approach,  as  yet 
insufficiently  exploited  by  those  who  are  currently  doing  research  into 
research  techniques. 
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Thus  far  the  emphasis  has  been  on  the  necessity  of  carefully  planning 
the  collection  of  data  so  that  significant  questions  can  be  answered  and 
on  the  problems  of  collecting  data  so  that  the  answers  will  be  reliable 
and  valid.  It  is  quite  obvious,  however,  that  performing  these  two  steps 
alone  will  not  provide  an  answer  to  the  original  hypothesis.  Rather, 
they  will  provide  a means  for  answering  it.  This  chapter  aims  at  clarify- 
ing  some  of  the  ways  in  which  such  data  can  be  so  ordered  as  to  provide 
the  desired  answers. 

Under  ideal  conditions  of  precision  and  simplicity  this  presents  very 
few  problems  since  the  statement  of  the  hypothesis  and  the  elaboration 
of  the  experimental  design  will  automatically  provide  for  the  analysis 
of  the  data.  Consider,  for  example,  a hypothesis  stated  as  follows:  If 
lights  are  selected  as  visual  stimuli  in  such  a way  that  each  is  subliminal 
(not  perceivable)  to  each  eye  taken  separately,  then  if  they  are  applied 
simultaneously  (but  still  separately)  they  will  be  perceived  by  the  subject. 
Such  a hypothesis  is  so  stated  that  either  there  will  be  perception  or 
there  will  not.  An  analysis  of  the  data  would  constitute  no  problem  at  all. 
If,  on  the  other  hand,  the  responses  were  such  as  to  show  not  an  absolute 
presence  or  absence  but  rather  proportions  or  degrees  of  presence,  the 
problem  of  analysis  would  become  more  complex.  If,  in  addition,  the 
proportions  of  those  perceiving  the  light  appears  to  be  different  among 
different  types  of  subjects,  e.g.,  men  and  women,  or  older  and  younger 
persons,  the  problem  becomes  rapidly  still  more  complex. 

In  other  words,  the  problems  raised  by  the  analysis  of  data  are  directly 
related  to  the  complexity  of  the  hypothesis  or  hypotheses.  Suppose,  for 
example,  a study  is  carried  on  to  test  the  hypothesis  that  family  size  is 
directly  related  to  size  of  the  home  in  which  the  family  resides,  with  the 
results  shown  in  the  table  which  follows.  Such  findings  would  force  a 
rejection  of  the  hypothesis  as  stated  and  should  lead  to  its  reformulation 
i,*  more  specific,  and  hence  more  complex,  terms.  Thus,  it  might  be 
restated  as  follows:  Th?  size  of  the  home  is  positively  correlated  with 
family  size,  xvhen  the  opportunities  for  choice  are  equal.  This,  of  course, 
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FAMILY  SUE 

PERCENTAGE  OF  FAMILIES  RESIDING  IN: 

Small  Houses 

Medium  Houses 

Large  Hauses 

Large  families 

25.0 

55.0 

20.0 

Medium  families 

30.0 

00.0 

10.0 

Small  families 

30.0 

55.0 

15.0 

requires  a clear  definition  of  what  factors  influence  freedom  of  choice. 
Clearly,  financial  ability  to  buy  or  rent  at  various  levels  is  such  a factor, 
since  size  of  home  is  correlated  with  its  costliness.  A retabulation,  there- 
fore,  such  as  the  following  one  might  then  appear.  This  type  of  find- 


ECONOMIC LEVEL, OF  FAMILY 

PERCENTAGE 

OF  LARGE  FAMILIES 

RESIDING  INI 

Small  Houses 

Medium  Houses 

Large  Houses 

High 

5.0 

35.0 

60.0 

Medium 

35.0 

65.0 

10.0 

Low 

40.0 

55.0 

5.0 

ing  would  tend  roughly  to  support  the  hypothesis  as  restated.  If  similar 
tabulations  were  carried  out  for  "medium”  and  "small"  families,  and  co- 
incided with  the  above  findings,  then  the  hypothesis  would  be  affirmed 
even  more  stiongly. 

However,  economic  ability  is  not  the  only  factor  which  interferes  with 
the  choice  of  a home.  In  some  areas  being  a member  of  a "non-Caucasian" 
race  will  inhibit  choice  through  restrictive  covenants.  Therefore,  it  might 
be  necessary  to  retabulate  the  above  in  terms  of  race.  Thus  such  an  anal- 
ysis would  lead  to  two  racial  divisions  (white  and  nonwhite)  within 
three  economic  levels  (high,  medium,  and  low)  for  three  sizes  of  family 
(large,  medium,  and  small).  This  would  yield  2 X 3 X 3,  or  18  such 
tables.  As  will  be  seen  later,  it  is  not  always  necessary  to  perform  the  task 
in  so  cumbersome  a fashion,  but  the  logical  problem  must  nonetheless  be 
dealt  with. 

If  all  the  relevant  factors  are  known  in  advance,  then  there  is  no  serious 
problem  in  analysis,  for  as  pointed  out  in  the  eaiiier  chapters,  the  experi- 
mental design  would  have  gathered  the  data  in  only  the  crucial  instances. 
There  are  two  aspects  of  scientific  research,  however,  which  make  the 
occurrence  of  so  ideal  a situation  relatively  infrequent,  The  first  of  these 
is  the  appearance  of  an  anomalous  empirical  regularity  or  the  absence  of 
an  expected  regularity.  Such  instances  require  analysis  not  anticipated  by 
the  original  design.  Similar  to  this  is  the  use  of  data,  gathered  for  one 
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purpose,  for  quite  another  problem.  Both  these  represent  what  is  called 
secondary  analyses.  From  a very  “pure"  experimental  point  of  view,  such 
analyses  are  considered  to  yield  answers  which  are  "plausible"  but  not 
capable  of  being  stated  in  the  customary  "probability"  terms  of  science, 

For  example,  in  the  case  of  the  housing  study  referred  to  above,  if  race 
and  economic  status  had  been  considered  in  the  original  design,  the 
sample  employed  would  have  been  so  constructed  as  to  contain  an  ade- 
quate and  representative  selection  from  all  the  various  subtypes.  That  is, 
a satisfactory  sample  of  each  family  size  vithin  each  racial  group  within 
each  economic  level  would  have  been  secured.  If  this  were  not  done  be- 
cause the  later  analysis  was  the  result  of  an  afterthought,  it  becomes  diffi- 
cult to  assess  the  value  of  the  old  sample  for  the  new  purpose, 

However,  secondary  analysis  is  not  only  common  but  necessary,  and  it 
is  certainly  of  great  value,  even  if  it  produces  no  more  than  plausible 
statements  to  serve  as  hypotheses  for  subsequent  verification  in  more 
stringent  terms.1 

In  a sense,  then,  problems  of  data  analysis  involve  all  the  questions 
raised  in  the  chapter  on  research  design,  for  secondary  analyses  do  involve 
the  designing  and  redesigning  of  substitutes  for  the  controlled  experi- 
ment. This  indicates  that  there  are  two  types  of  questions  about  analysis 
which  may  be  raised.  One  of  them  relates  to  the  techniques  of  representing 
the  data , and  the  other  to  the  methods  of  logically  ordering  them  so  that 
questions  can  be  raised  and  answered. 

The  first  of  these  general  problems  cannot  be  treated  exhaustively  in 
a book  such  as  this  and  are,  in  any  case,  part  of  the  usual  content  of 
statistics  courses.  A few  basic  comments  concerning  the  methods  of  repre- 
senting the  data  will  be  made,  but  the  major  object  of  attention  will  be 
the  logical  processes  involved  in  secondary  analysis. 

STATISTICAL  REPRESENTATION 

The  frequency  distribution.  The  simplest  form  of  representing  research 
findings  is  the  frequency  distribution  or  tabulation,  All  that  is  meant  by 
this  is  the  presentation  in  one  column  of  different  qualities  of  an  attribute, 
or  diffeient  values  of  a variable,  together  with  entries  in  another  column 
showing  the  frequency  of  the  occurrence  of  each  of  the  classes.  The  only 
problems  connected  with  the  preparation  of  a useful  frequency  distribu- 
tion, or  simple  table  as  it  may  also  be  called,  is  to  use  common  sense  with 
respect  to  three  things. 

i A detailed  discussion  of  this  point  can  be  found  In  Continuities  in  Social  Research, 
Robert  K.  Merton  and  Paul  F.  Lararsfeld,  eds,  (Glencoe,  III.:  Free  Press,  1950),  passim, 
csp.  pp.  1S3-143  and  pp.  197-2U. 
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First,  the  units  entered  in  the  left-hand  column  describing  the  qualities 
or  values  must  be  mutually  exclusive,  as  well  as  inclusive  of  the  vast 
majority  of  observations  which  will  be  made.  Overlapping  values  or 
attributes  can  only  lead  to  confusion.  For  example,  in  a study  of  Puerto 
Rico  one  of  the  authors  found  that  besides  Catholic,  Protestant,  and  "no 
religion"  categories,  there  were  some  who  professed  Spiritualism.  This 
would  have  made  no  problem  except  for  the  fact  that  it  appeared  possible 
to  combine  Spiritualism  with  either  Protestantism  or  Catholicism.  There- 
fore, instead  of  just  adding  a category,  Spiritualist,  it  was  necessary  to  add 
also  Catholic  and  Spiritualist  as  well  as  Protestant  and  Spiritualist. 

Second , the  tabulation,  to  be  of  the  most  utility,  must  have  internal 
logic  and  order.  It  seems  quite  obvious  that,  if  one  were  tabulating  such 
a variable  as  the  height  of  men  or  the  size  of  cities,  he  would  tabulate  in 
order  either  from  the  tallest  or  largest  to  the  shortest  or  smallest,  or  vice 
versa.  However,  when  tabulating  qualities,  where  the  order  may  not  be 
so  obvious,  the  need  for  a logical  treatment  is  equally  great.  For  example, 
one  survey  asked  women  for  their  reasons  for  buying  a certain  face  cream.* 
One  tabulation  of  the  results  is  shown  in  the  accompanying  table. 


Reasons  for  Buying  Face  Cream  * 

Percentage  of 

Reasons 

Respondents 

Recommendation 

28.0 

Beneficial  to  skin 

21.0 

Heard  it  advertised  over  the  radio 

18.0 

Saw  it  on  the  counter 

15.0 

Reasonably  priced 

10.0 

Scent  appealed 

8.0 

Because  of  special  skin  conditions 

7.0 

Total 

107.0 

• This  is  not  strictly  speaking  a frequency,  but  a percentage  distribution.  This  prob- 
lem will  be  taken  up  later. 

First  of  all,  since  the  total  reaches  107  per  cent,  it  is  clear  that  this  table 
violates  the  first  requirement,  namely,  that  the  categories  be  mutually 
exclusive.  Ignoring  this  fact,  however,  it  is  clear  that  there  is  no  sensible 
order  to  the  types  of  reasons  given.  How  is  it  possible  to  improve  the 
utility  of  the  tabulation  for  analytical  purposes?  Since  there  is  no  con- 
tinuum such  as  that  for  height  or  size,  the  procedure  is  to  search  for 
logical  groupings  of  the  responses.  It  is  possible  to  discover  three  such 
larger  categories,  and  the  use  of  the  table  increases  greatly  if  it  is  modi- 
fied in  this  way. 

* Adapted  from  Hans  Zeisel,  Say  It  with  Figures  (New  York:  Harper,  1947),  2d  ed., 
pp.  7-8. 
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Reasons  for  Buying  Face  Cream 


Reasons 

Percentage  of 
Respondents 

Pertaining  to  respondent: 

28.0 

Beneficial  to  skin 

21.0 

Special  skin  condition 

7.0 

Pertaining  to  product: 

18.0 

Reasonably  priced 

10.0 

Scent  appealed 

8.0 

Pertaining  to  way  heard  of  product: 

ei.o 

Recommendation 

28.0 

Heard  radio  advertising 

18.0 

Saw  it  on  counter 

15.0 

Not  all  tabulations  will  have  a logical  structure,  but  in  most  cases  it  is 
possible  to  find  a logical  order  to  use  as  an  analytical  principle,  even  in 
simple  tabulation. 

Third,  when  the  left-hand  column  of  a tabulation  is  a quantitative 
variable  such  as  size  of  city  or  monthly  rental  value,  the  class  intervals 
must  be  carefully  and  reasonably  chosen.  Schmid  suggests  three  criteria 
for  this:  * 

“1.  Ordinarily  there  should  not  be  less  than  8 or  10  and  not  more  than  18  or 
20  class-intervals,  depending  on  the  nature  of  the  data  and  on  the  number  of 
cases  being  studied.  In  order  to  obtain  a clear  understanding  of  the  original  data 
the  individual  items  are  frequently  arranged  in  either  ascending  or  descending 
order  of  magnitude.  Such  a classification  is  known  as  an  array . After  noting  the 
highest  and  lowest  values  as  well  as  the  characteristic  feature*  of  the  data,  the 
number  of  intervals  can  be  determined  more  easily. 

"2.  Every  effort  should  be  made  to  have  intervals  of  uniform  size.  The  intervals 
should  not  be  so  small  as  to  lose  the  advantages  of  summarization  or  so  large  as 
to  conceal  the  more  important  characteristics  of  the  distribution.  Moreover,  if 
the  dass-iu tervals  are  too  small,  vacant  or  blank  intervals  might  occur.  If  corn- 
pat  isons  are  to  be  made  between  similar  data,  it  is  advisable  to  select  class- 
intervals  of  the  same  size  for  all  the  distributions.  Whenever  possible  the  class- 
intervals  should  represent  common  and  convenient  numerical  divisions  such  as 
5 or  10,  rather  than  odd  divisions  such  as  3 or  7. 

' 3.  After  the  size  of  the  class-intervals  has  been  determined,  it  is  important 
that  they  be  dearly  designated  in  the  frequency  table.  Each  interval  must  have 
definite  lower  and  upper  limits,  and  must  be  expressed  in  such  a way  as  to 
obviate  any  possibility  of  misinterpretation  or  confusion." 

These  seem  like  simple  principles,  and  indeed  they  are.  In  practice, 
however,  they  are  somewhat  difficult,  and  many  analyses  of  data  have 
become  difficult  to  perform  or  understand  because  of  ignoring  them. 

* Calvin  F.  Schmid,  “Basic  Statistical  Concepts  and  Techniques,"  In  Pauline  V.  Young. 
Scientific  Social  Surveys  and  Research  (New  York:  Prentice-Hall,  1949),  pp.  291-292. 
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Summarizing  the  frequency  distribution.  There  are  two  general  ways  of 
stating  a frequency  distribution  in  simple  ways,  These  are  used  singly  and 
together,  depending  upon  the  problem  at  hand.  One  of  them  is  to  compute 
a value  which  represents  the  central  tendency  of  the  distributions.  Such 
measures  are  called  averages  and  include  among  others  the  common  aver- 
age, technically  known  as  the  mean;  the  median,  a value  such  that  halt'the 
entries  in  a frequency  table  fall  below  and  half  above  it;  and  the  mode,  or 
the  value  represented  by  the  greatest  frequency. 

The  other  general  type  of  summary  of  a frequency  distribution  includes 
measures  of  dispersion  such  as  the  standard  deviation  and  the  coefficient  of 
variation . These  measures  are  used  to  compare  the  relative  wideness  of 
spread  in  any  two  or  more  frequency  distributions.  Their  characteristics, 
applications,  and  methods  of  computation  can  be  found  in  any  elementary 
textbook  on  statistics  and  will  not  be  considered  here.  They  are  mentioned 
merely  as  being  widely  used  ways  of  summarizing  frequency  distributions. 

Comparing  frequencies.  A common  and  simple  method  of  comparing 
frequencies  is  the  use  of  the  ratio.  A ratio  is  merely  an  indicated  or  actual 
quotient  which  relates  the  size  of  one  number  to  another.*  Their  chief 
utility  is  to  act  as  a relative  measure  and  thus  permit  the  comparison  of 
otherwise  unequal  numbers.  For  example,  if  we  wish  to  know  the  relation 
of  the  female  to  the  male  death  rates  at  various  ages,  a series  of  ratios,  as 
shown  in  the  accompanying  table,  will  help.  This  adds  to  the  knowledge, 


Deaths  per  Thousand  Native  White  Population  by  Age  and  Sex 
in  Cities  of  100,000  or  more,  Ohio,  1930  • 


Age 

Male 

Female 

Ratio  of  Male 
to  Female 

CM 

17,2 

13.6 

1.26 

5-9 

2.4 

1.7 

1.41 

10-14 

1.5 

1.2 

1.25 

15-19 

2.4 

1.9 

1.26 

45-54 

12.3 

10.0 

1.23 

55-64 

25.8 

19.9 

1.30 

65-74 

59.1 

45.9 

1.29 

75  and  over 

135.6 

114.5 

1.18 

• Adapted  from  Warren  S.  Thompson,  Population  Problems  (New  York*,  McGrawHiU, 
1942),  p.  228. 

shown  by  the  second  and  third  columns,  that  male  death  rates  are  gen- 
erally higher  than  female,  by  showing  that  this  female  advantage  is  con- 
stant through  life,  except  for  being  somewhat  larger  than  usual  at  ages 
5 to  9,  and  somewhat  smaller  at  ages  75  and  over.  The  figures  in  column 

* See  Margaret  Jarman  Hagood,  Stctistics  for  Sociologists  (New  York:  Holt,  1941),  pp. 
113-115. 
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four  are  actual  quotients;  they  could  have  been  expressed  as  172  to  136, 
24  to  17,  15  to  12,  etc.,  but  it  is  the  actual  quotients  which  are  most  useful 
since  they  reduce  the  right-hand  figure  to  1 in  every  case,  thus  allowing 
the  comparison  to  be  made  easily. 

A related  method  of  comparing  values  is  the  proportion.  This  measure 
is  a fraction  such  that  the  numerator  is  one  of  two  observed  frequencies 
and  the  denominator  the  sum  of  observed  frequencies.  Thus  the  previous 
table  could  have  been  expressed  as  the  proportions  of  all  deaths  at  various 
ages  which  were  male  deaths.  Thus  the  ratios  1.26,  1.41,  1.25,  etc.,  ex- 
pressed as  proportions  would  read  0.558,  0.585,  0.555,  etc.  When  propor- 
tions are  expressed  in  multiples  of  100,  they  are  percentages . Thus  in  the 
above  example,  55.8  per  cent  of  all  deaths  in  large  Ohio  cities  in  1930 
between  ages  0 and  4 were  experienced  by  males,  etc.  Whether  ratios, 
proportions,  or  percentages  are  used  in  the  analysis  of  data  is  purely  a 
matter  of  preference  and  of  the  way  in  which  the  researcher  wishes  to  com- 
municate his  find?~g$.6 

The  purpose  of  using  percentages  (and  ratios  and  proportions  as  well)  is 
to  simplify  the  problem  of  comparison.  It  is  important,  therefore,  to  see 
exactly  what  their  use  implies  so  that  they  will  not  be  misused.  First,  they 
can  serve  to  put  qualitative  characteristics  into  numerical  form.  Thus  it 
is  possible  to  compare  two  college  classes  on  the  basis  of  sex  by  saying  that 
one  class  is  60  per  cent  male  and  the  other  only  40  per  cent.  This  is  a per- 
fectly good  comparison,  but  care  must  be  taken  not  to  translate  this  state- 
ment by  saying  that  one  is  more  7nasculine  than  the  other  and  thus  im- 
plying that  masculinity  is  a quantitative  characteristic.  Such  errors  in 
logic  are  common  and  easily  fallen  into,  but  the  consequences  may  be 
disastrous. 

Second,  percentages  reduce  two  frequency  distributions  to  a common 
base,  as  was  illustrated  above,  thus  making  comparisons  much  simpler.  It 
must  be  noted,  however,  that  this  obscures  absolute  comparisons,  and 
sometimes  it  is  these  rather  than  the  relative  comparisons  which  are  im- 
portant. Zeisel  gives  an  example  of  this  problem  6 as  shown  in  the  accom- 
panying table.  Here  the  absolute  and  the  relative  figures  produce  ap- 


Comparative  Growth  of  Cities  M and  N,  1941-1946 


City 

1941 

1946 

Ir  crease 

Percentage 

Increase 

M 

1,000.000 

1,200.000 

200.000 

20.0 

N 

500,000 

650,000 

150,000 

30.0 

6 See  ibid.,  pp. 

115-125. 

• Zeiiel,  op.  of.,  p.  77, 
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parently  opposite  results.  The  problem,  then,  is  which  is  "correct."  Did 
city  Af  grow  mote  or  less  rapidly  than  city  Nf  It  is  clear  that  this  is  a 
problem  in  logic.  Thus  it  could  be  reasoned  that,  if  the  growth  were 
"normal,"  we  would  expect  the  larger  city  to  attract  more  people  than 
the  smaller  and  hence  would  say  the  percentages  are  the  "significant" 
figure.  If,  on  the  other  hand,  the  growth  of  cities  depended  not  upon 
previous  size  but  other  factors  such  as  the  addition  of  new  industries,  then 
we  might  say  that  the  larger  city  grew  faster  in  the  ratio  of  200,000  to 
150,000,  or  one  and  one  third  times  as  fast. 

In  other  words,  while  percentages  are  a great  aid  to  communication 
through  simplification,  they  are  also  susceptible  to  misuse  by  obscuring 
significant  facts.  This  underlines  the  importance  of  never  presenting  a 
table  of  percentages  without  also  showing  the  actual  numbers  which  they 
represent.  This  not  only  guards  against  the  kind  of  misinterpretation  of 
the  kind  described  above,  but  also  assures  the  reader  that  there  is  an  ade- 
quate  base  for  the  calculation  of  a percentage.  For  example,  the  statement 
that  three  of  a total  of  only  four  people  interviewed  favored  the  Demo- 
cratic party  becomes  very  misleading  in  the  form  "three  out  of  every 
four”  or  "75  per  cent”  of  those  interviewed  favored  the  Democratic 
platform. 

Bases  for  computing  percentages.  When  a simple  frequency  distribution 
is  turned  into  percentages  it  is  usually  for  the  purpose  of  making  clear 
the  comparisons  between  the  several  class  intervals.  In  this  case  there  is 
seldom  a problem  as  to  what  the  base  of  the  percentages  should  be. 
One  case,  however,  in  which  a difficulty  may  arise,  is  where  the  tabulation 
includes  residual  categories,  or  more  than  one  dimension.  An  example 
will  show  this  more  clearly.  If  a sample  polled  in  relation  to  a bond  issue 
gave  the  results  shown  in  the  accompanying  table,  there  would  be  a prob- 
lem of  interpretation.  Thus  if  the  base  for  percentaging  were  taken  as  the 

Responses  to  Question,  "Do  You  Favor  the  School  Bond 
Issue  Which  Will  Be  Voted  On  in  November?" 


Responses 

Number 

Yes 

97 

No 

78 

Undecided 

44 

Refuse  to  Answer 

51 

Total 

555 

total  sample,  those  favoring  would  be  reported  as  39  per  cent;  if  the  base 
were  all  those  replying,  the  result  would  be  44  per  cent;  and  if  it  were 
taken  as  those  who  would  answer  and  who  had  made  up  their  minds,  it 
would  be  56  per  cent.  It  is  very  important,  therefore,  that  the  base  of  a 
percentage  be  fully  and  carefully  described  to  avoid  confusion. 
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The  more  common  problem  arises  when  a cross  tabulation  is  used,  fox 
in  this  case,  even  though  the  categories  are  complete,  it  is  still  necessary 
to  make  a choice  as  to  whether  the  percentages  snould  be  computed  hori- 
zontally or  vertically,  Zeisel  gives  a general  rule  which  can  usually  be 
followed,  by  stating7  that  “the  percentages  should  be  computed  in  the 
direction  of  the  causal  factor !*  This  does  not  mean  that  one  of  the  factors 
must  be  the  actual  cause  of  the  other,  but  merely  that  in  the  mind  of  the 
analyst  one  of  them  is  thought  of  as  influencing  the  other.  The  illustration 
gr'en  by  Zeisel  makes  this  clear.  First  of  all,  percentages  may  be  calcu- 


Deaths  from  Cancer  in  the  United  States  by  Race 


RACE 

CAUSE  OF  DEATH 

Cancer 

All  Others 

Total 

White 

159,627 

1,055,804 

1,195,451 

Negro 

9,182 

169,391 

178,575 

Total 

148,809 

1,225,195 

1,574,004 

lated  to  either  the  vertical  or  the  horizontal  totals.  Thus  the  figures  given 

in  the  first  of  the  accompanying  tables  could  be  expressed  in  percentages 

calculated  vertically,  so  as 

to  appear  as  shown  in  the  second  table.  Such 

Race 

Cancer 

All  Others 

Total 

White 

93.8 

86.2 

87.0 

Negro 

6.2 

13.8 

13.0 

Total 

1H0 

100.0 

100.6 

a presentation  shows  the  racial  composition  of  two  groups,  those  dying  of 
cancer  and  those  dying  of  some  other  cause.  This  is  a clumsy  way  of  pre- 
senting the  analysis,  since  we  are  not  likely  to  think  the  cause  of  death  wdl 
affect  materially  the  relative  number  of  whites  and  Negroes.  Rather  it  is 
more  probable  that  the  hypothesis  was  that  for  various  reasons  death  ac- 
companied by  a diagnosis  of  cancer  is  more  common  for  one  group  than 
another.  Therefore  the  percentages  should  be  computed  horizontally,  as 
shown  in  the  third  table.  Such  an  arrangement  points  directly  to  the  de- 


Race 

Cancer 

All  Others 

Total 

White 

11.7 

88.3 

100.0 

Negro 

5.1 

94.9 

100.0 

Total 

10.7 

89J 

100.0 

] » Ibid.,  p.  88. 
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sired  Negro-white  comparison,  rather  than  requiring  still  further  analysis 
as  did  the  other  mode  of  calculating  the  percentages. 

It  will  frequently  be  the  case  that  this  "causal  principle"  will  not  be 
applicable  as  clearly  as  in  the  above  case.  For  example,  suppose  the 
number  of  years  of  school  completed  were  being  cross-tabulated  with 
"economic  worth"  for  a series  of  adults;  would  higher  education  be  con- 
sidered as  the  "cause"  of  higher  economic  standing,  or  vice  versa?  Clearly 
it  could  be  viewed  either  way.  The  general  principle,  then,  is  to  compme 
in  the  direction  of  the  factor  which  the  analyst  wishes  to  emphasize. 

That  the  analyst's  concept  of  causality  or  desire  for  emphasis  is  not 
always  the  sole  determiner  of  the  direction  in  which  percentages  should 
be  computed,  however,  is  shown  by  another  principle  stated  by  ZeiseL* 
This  principle  is  that  percentages  should  be  run  only  in  the  direction  in 
which  a sample  is  representative.  The  illustration  of  this  rule  deals  with 
the  collection  of  political  preferences  from  a representative  sample  of 
8,000  persons  in  each  of  two  states,  with  the  results  shown  in  the  accom- 
panying table.  If  these  results  are  analyzed  from  the  point  of  view  that 


FAVOR  PARTY 

STATE 

Total 

XX 

YY 

A 

2,500 

5,500 

8,000 

B 

3,500 

4,500 

8,000 

differences  between  the  states  are  "causal"  for  differences  in  party  alle- 
giance, then  the  table  would  be  percentaged  horizontally,  as  shown  in  the 
second  table.  Such  results  would  be  "sensible"  in  showing  that  party  YY 


PERCENTAGE  FAVORING  PARTY 

STATE 

Total 

XX 

YY 

A 

31.0 

69.0 

100.0 

B 

44.0 

5*0 

100.0 

was  in  a favorable  position  in  both  states,  but  especially  so  in  state  A . 

If  the  original  question  were  reversed,  however,  a^d  it  w°re  hypothe- 
sized that  the  differences  in  the  parties  rather  than  the  differences  in  the 
states  was  the  "cause"  of  the  distribution,  the  percentages  would  have  been 
calculated  vertically,  as  shown  in  the  third  table.  If  this  table  were  an 

• Ibid.,  p.  101. 
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STATE 

rratENTACt  favoiung  party 

AA 

YY 

A 

42.0 

55.0 

B 

58.0 

45.0 

Tout 

Fooo 

1000 

accurate  analysis  it  would  seem  to  say  that  42  per  cent  of  the  votes  of  party 
XX  would  come  from  state  A.  If  it  is  recalled,  however,  that  the  samples 
were  representative  of  each  state  and  not  of  the  two  parties,  it  is  clear  that 
these  percentages  ate  meaningless.  Since  we  do  not  know  the  relative  con- 
tribution of  the  two  states  to  the  total  vote,  no  such  percentages  can  be 
meaningful.  Therefore  where  crovs  txoulalions  are  representative  in  only 
one  direction,  percentages  can  be  computed  only  in  that  same  direction. 

Presenting  complex  tables.  Several  examples  have  been  given  in  which 
two-dimensional  tables  or  simple  cross  tabulations  have  appeared.  Often 
such  tables  are  not  adequate  for  the  problem  at  band  where  more  than 
two  dimensions  must  be  simultaneously  presented.  The  example  of  the 
si/e  of  houses  given  at  the  beginning  of  the  chapter  is  a case  in  point. 
There  it  was  pointed  out  that  a complete  presentation  would  involve  at 
least  six  tables  placed  side  by  side.  It  is  clear  that  such  a presentation 
would  be  exceedingly  confusing  and  thus  poses  the  problem  of  simplifying 
the  presentation.  Suppose  we  had  only  three  variables  to  deal  with:  si/e 
of  family,  she  of  home,  and  economic  status.  Suppose  further  that  we  are 
provided  with  the  data  shown  in  the  accompanying  table. 


The  Relation  of  Si/e  of  House  to  Si/c  of  Family,  by  Economic  Status 


iNCOMt 

iJCRCt  MVILItS 

MfNt'M  MMtUM 

IM At L rAMTUtt 

Small 

House 

Medium 

House 

L»tge 

House 

Small 

House 

Medium 

House 

Lttge 

House 

Small 

Hous* 

Medium 

House 

House 

High 

5 

55 

60 

5 

TO 

25 

to 

60 

50 

Medium 

25 

65 

20 

to 

65 

25 

<0 

55 

5 

tow 

$5 

55 

to 

)5 

55 

to 

T5 

25 

0 

One  way  of  reducing  the  'able  is  to  remove  one  variable.  Now  any 
dichotomous  percentage  can  be  expressed  by  one  figure  only,  since  the 
second  is  then  determined.  If  we  say  65  per  cent  of  a group  are  men  it  is 
not  necessary  to  state  also  that  55  per  cent  are  women.  Thus  if  it  is  pos- 
sible to  turn  any  of  the  tticholomous  variables  into  dichotomous  ones, 
the  table  can  be  greatly  simplified.  So  suppose  the  analyst's  interest  is  in 
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showing  the  problem  of  crowding,  ami  he  therefore  rewords  the  table  title 
and  reenters  the  data  as  shown  in  the  second  table. 


Percentage  of  Families  of  Various  Sires,  by  Income  Groups, 
Residing  in  Small  Houses 


INCOME. 

Small 

FAMILY  Hit 

Medium 

Urge 

High 

10.0 

10.0 

5.0 

Medium 

40.0 

10.0 

25  0 

Low 

75.0 

so.o 

40.0 

It  would  be  possible  to  conclude  from  these  data  that  income,  more 
than  family  sire,  determined  the  proportion  of  small  houses  occupied. 


SaoI  Matam  tort* 

MMllllS 

fIGt  HE  I.  Graphic  prc^cntitioo  of  a trkbocomous  cross  tabulation  (income  and  sire 
of  families)  for  families  living  in  small  houses. 

Figure  I graphically  illustrates  what  such  a tabulation  actually  does  to 
the  data  by  showing  the  three^Iimensional  character  of  the  cross  tabu- 
lations However,  turning  a trichotomy  into  a dichotomy  is  d ' mil  and 
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sometimes  obscures  the  real  point.  Another  method  ot  summarizing  the 
distribution,  it  will  be  recalled,  is  die  use  of  an  average.  This  table,  if 
the  values  of  "small/'  "medium,"  and  "large"  houses  are  known,  can  be 
simplified  by  the  use  of  an  average  such  as  the  mean,  as  shown  in  the 
third  table. 

Average  Number  ot  Rooms  in  Houses  for  Families  ot  Various  Sires, 
by  Income  Groups 


INCOME 

Small 

FAMILY  SIZE 

Medium 

Large 

High 

4.7 

4.6 

5.7 

Mcdfum 

S.2 

4A 

4.4 

Low 

2.1 

5.4 

5.4 

Again,  the  table  becomes  quite  readable.  The  logic  involved  should  be 
clear.  In  both  cases  the  cell  entries  have  been  changed  from  frequencies 
or  percentages,  representing  every  category  of  the  frequency  distribution, 
into  single  tutues,  each  standing  for  the  enure  distribution. 

General  problems  of  presenting  frequency  distributions.  This  section 
has  been  concerned  primarily  with  showing  some  of  the  ways  of  sum- 
marizing, simplifying,  and  presenting  frequency  distributions.  Some  of  the 
more  common  problems  have  been  spelled  out  and  others  indicated  as 
belonging  to  the  realm  of  statistics.  With  these  in  mind,  the  next  section 
turns  to  some  of  the  logical  problems  underlying  simple  analyses  by  cross 
tabulation. 


LOGICAL  ORDERING  OF  DATA 

The  cross  tabulation  of  two  or  more  attributes  or  variables  is  merely  a 
formal  and  economical  method  of  arranging  the  data  so  that  the  logical 
methods  of  proof  may  be  applied.  Thus,  the  methods  of  agreement,  dif- 
ferences, or  concomitant  variation  (correlation)  may  all  be  used  in  draw- 
ing concl  isions  from  a cross  tabulation.  This  should  further  underline  the 
point  made  earlier  in  this  chapter,  that  what  can  be  gained  through  cross 
tabulation  depends  entirely  on  the  logical  design  of  the  study  and  the 
insighttulness  of  the  investigator.  There  are  no  "tricks"  of  cross  tabulation 
which  can  guarantee  that  an  analysis  will  provide  the  most  significant  and 
meaningful  results  possible. 

It  is  possible,  however,  to  discuss  some  of  the  modes  of  thought  which 
lead  to  fruitful  cross  tabulation.  First  of  all,  the  use  of  cross  tabulation 
is,  in  efTect,  an  approximation  of  the  controlled  experiment.  This  means 
that  the  analyst  is  really  thinking  in  terms  of  cause  and  effect.  That  is,  he 
has  in  mind  one  or  more  variables,  variation  in  which  can  be  used  to 
explain  variation  in  another  variable.  These  "causal"  dimensions  are 
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termed  independent  variables  and  the  values  to  be  explained  are  called 
dependent  variables . It  is  because  the  analyst  is  really  thinking  in  causal 
terms  that  there  is  a need  for  elaborating  survey  results  through  cross 
tabulation.  If  all  that  the  analyst  desires  is  an  empirical  statement  of 
concomitant  variation,  simple  cross  tabulations  will  suffice.  The  need, 
however,  to  know  the  "meaning"  of  a relationship  leads  to  asking  many 
other  questions  which  can  be  answered  only  by  introducing  oilier  vari- 
ables into  the  analysis. 

Kendall  and  La?arsfcld  have  given  a classification  of  three  types  of  elab- 
oration which  they  call  interpretation , explanation,  and  specification^ 

Interpretation * This  is  the  process  of  stratifying  a previous  cross  tabu- 
lation by  another  variable  (called  a test  xmiallt)  under  circumstances 
such  that  the  test  variable  has  occurred  (I)  subsequent  in  time  to  the 
independent  variable,  but  (2)  prior  to  the  dependent  variable.  One  other 
requirement  is  that  the  test  factor  be  correlated  with  both  indc]>endcnt 
and  dependent  variables.  Such  a procedure  allows  the  further  interpreta- 
tion of  an  original  relationship  in  terms  of  a third  (actor,  in  this  case 
called  an  intervening  variable* 

This  may  be  diagrammed  as  follows: 

Itii 

VoricVl 

VoricbU 

O O 

FIGURE  2 

For  example,  when  the  men  students  at  a private  university  were  cross- 
tabulated  by  whether  they  attended  public  or  private  secondary  schools 
and  by  their  university  academic  records,  those  attending  public  schools 
showed  a superior  record.  On  the  face  of  it,  suth  a finding  would  suggest 
an  explanation  of  the  dependent  variable  (grades)  in  teims  of  the  excel- 
lence of  preparation  in  the  independent  variable  (public  versus  private 
secondary  tchools).  Anyone  conversant  with  the  admissions  policy  of  pri 
vatc  universities  might  suspect,  however,  tnat  events  could  have  occurred 
between  secondary  training  and  acceptance  at  the  university  which  might 
give  the  finding  an  entirely  different  meaning. 

Thus  an  intervening  variable  in  terms  of  excellence  of  academic  work 
in  secondary  school  could  logically  be  introduced.  If  there  were  circum- 
stances in  the  application  procedure  which  meant  that  public  school  grad- 
uates had  »o  meet  higher  academic  standards  than  private  school  giad- 
uates,  the  original  correlation  between  the  type  of  secondary  school  and 
subsequent  grades  would  disappear.  In  this  case,  the  lest  factor,  high  school 

* Patricia  t~  Kendall  and  Taut  T.  1 arartfcM.  ‘TroMcnw  of  Ritncy  Analysis/*  Co*- 
tinnitus  in  Soritl  Hnentth,  Robert  K.  Met  ton  and  Taul  *.  LarattfeM,  edi  (Glencoe, 
111.:  Free  Frets,  1950y  The  emite  paper  is  relevant  here,  but  especially  pp.  It  A- ISA. 
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record,  would  sene  to  interpret  the  original  finding  since  when  men  of 
the  same  high  school  achievement  level  are  compared,  the  relation  of  type 
of  school  to  undergraduate  success  would  disappear. 

Explanation.  This  is  essentially  the  same  kind  of  reasoning  as  inter- 
pretation in  that  ii  seeks  to  reduce  <n  originally  observed  correlation 
through  the  use  of  a test  factor.  In  this  case,  however,  the  test  factor  is 
an  antecedent  variable  rather  than  an  intervening  variable.  In  o!her 
words,  an  observed  relationship  is  explained  in  terms  of  a third  factor 
which  occurred  earlier  than  the  independent  variable  and  which  is  related 
both  to  it  and  the  dependent  variable. 

This  may  be  diagrammed  as  follows: 


For  example,  a study  of  birth  rates  in  Puerto  Rico  produced  a cross 
tabulation  which  showed  a negative  correlation  between  economic  status 
and  the  birth  rate.  Both  the  variables  in  this  case  were  measures  of  adult 
life  experience,  and  since  "causal'*  reasoning  was  involved,  the  question 
was  asked  as  to  whether  or  not  there  was  some  variable  characterising  the 
earlier  portion  of  the  life  history  which  might  "explain"  this  correlation. 

Consequently  education  was  used  as  an  antecedent  test  variable,  with 
the  result  that  the  correlation  between  economic  status  and  birth  rates 
all  but  disappeared  when  cross  tabulated  separately  for  comparable  edu- 
cational levels.  In  this  easr(  then,  the  use  of  a test  factor  "explained 
away"  an  original  observation  and  substituted  another  for  it.  We  should 
describe  the  relationship  between  economic  status  and  bitth  rates  in 
Puerto  Rico  as  a spurious  correlation.  This  raises  the  question,  "Hew  can 
see  be  ceitain  that  the  relationship  between  education  and  birth  rates  is 
actually  a causal  one  and  not  merely  another  spurious  correlation)"  The 
only  answer  to  this  is  to  say  that  if  the  application  of  all  known  relevant 
•est  factors  fails  to  reduce  this  correlation  materially,  the  assumption  ol 
causality  can  usually  bo  made. 

Specification.  Whereas  in  explanation  and  interpretation  the  question 
asked  is  whether  or  not  the  use  of  a test  variable  will  cause  the  sharp 
reduction  or  disappearance  of  an  observed  relationship,  the  goal  of  spcci 
fication  is  quite  different.  In  this  case  the  "causal"  sequence  is  irivxlified 
by  specifying  varying  conditions  under  which  the  original  correlation  will 
exist  in  greater  or  lesser  intensity.  The  example  given  by  Kendall  and 
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Lazarsfeld  uses  the  findings  from  The  American  Soldier1*  shown  in  the 
accompanying  table. 

Educational  Level 


High  School 

Les»  than 

Rank 

Graduate 

High  School 

or  Better 

Graduate 

Non-com. 

61.0 

«.o 

Pvt.,  Pfc. 

39.0 

57.0 

Total  case* 

3,222 

3.132 

A crude  measure  of  the  relationship  between  education  and  rank  is 
taken  to  be  the  difference  between  61  per  cent  and  43  per  cent.  This  value, 
called  /,  is  0.18.  Now  the  question  was  asked  whether  an  / value  of  0.18 
would  be  likely  to  be  found  under  all  circumstances,  and  it  seemed  logics* 
that  the  answer  was  "no."  That  is,  opportunities  for  promotion  may  not 
have  been  the  same  at  all  periods  of  World  War  II.  Hence  a cross  tabu- 
lation by  length  of  service  was  made,  as  shown  in  the  second  table, 


Rink 

Hatt  Served  for  Lm 
than  Two  Yean 

Have  Served  for  Two 
Yean  or  More 

High  School 
Graduate 
cc  Better 

Less  than 
High  School 
Graduate 

High  School 
Graduate 
or  Better 

Leu  than 
High  School 
Graduate 

Non-com. 

23.0 

17.0 

74.0 

53.0 

Pvl.,  Pfc. 

77.0 

83.0 

26.0 

47.0 

Total  um 

842 

823 

2,380 

2,329 

/- 

0.06 

/- 

0.21 

Since  the  f%  are  quite  different  for  the  two  length  obsen  ice  categories, 
the  analyst  can  now  state  that  the  relation  between  education  and  rank  is 
greater  for  those  who  entered  the  Army  early  than  for  those  who  entered 
it  later. 

Specification,  then,  is  the  process  of  describing  the  conditions  under 
which  a particular  relationship  may  exist  or  not  exist,  or  may  exist  to  a 
greater  or  lesser  degree.  Like  explanation  and  interpretation,  therefore,  it 
makes  any  "causal"  analyses  more  acceptable  and  meaningful. 

General  aspects  of  elaboration.  There  item  to  be  two  major  problems 
with  respect  to  this  aspect  of  analysis.  The  applications  of  test  factors  in 

iifMI,  pp. 
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the  three  preceding  paragraph*  doubtless  seem  simple  enough.  The  two 
practical  problems,  however,  are  (1)  whether  or  not  to  elaborate,  and 
(2)  what  test  variables  to  apply. 

With  respect  to  the  fust  problem  it  can  be  pointed  out  that  unexpected 
consequences  may  impel  the  analyst  to  elaborate.  If  the  findings  conform 
to  anticipation,  there  is  little  stimulus  for  further  analysis.  This  is  a 
somewhat  dangerous  situation,  for  it  may  result  in  the  acceptance  of 
spurious  or  uninterpreted  relationships.  As  Kendall  and  Lazarsfeld  put  it: 

"Our  first  concern  is  whether  or  not  the  relationship  is  a spurious  one;  conse- 
quently our  initial  efforts  of  elaboration  -re  usually  of  the  explanatory  type. 
Once  we  have  gained  some  assurance  that  the  original  relationship  is  not  a 
spurious  one,  ve  try  to  interpret  the  result  or  to  specify  it.11  We  ask  ourselves  what 
variables  might  provide  the  links  between  the  ’cause*  and  the  ‘effect/  or  what 
conditions  might  show  the  original  relationship  to  bo  even  more  pronounced 
than  we  originally  saw  it  to  be.  The  elaboration  of  a particular  result  can  go  on 
almost  indefinitely.  We  are  limited  only  by  our  lack  of  ingenuity  in  thinking  of 
factors  by  which  to  elaborate  the  result,  by  the  absence  of  data  to  check  the 
relevance  of  factors  which  we  have  thought  of,  or  by  the  difficulties  of  dealing 
with  few*  cases  as  the  process  of  elaboration  is  extended/* 

Elaboration,  therefore,  is  a process  which  is  limited  only  by  the  analyst, 
on  the  one  hand:  his  ability,  his  patience,  and  his  putposes;  and,  on  the 
other  hand,  by  the  nature  of  his  data. 

Unfortunately,  it  is  not  possible  to  give  concise  directions  for  selecting 
the  factors  to  be  used  as  test  variables.  It  should  be  recalled  that  the  seleo 
tion  of  every  test  variable  actually  constitutes  the  formation  of  a hy- 
pothesis which  can  be  tested  by  the  data  at  hand.  Therefore  all  the 
materials  in  the  chapters  on  hypotheses  and  experimental  design  will 
apply  here. 

fn  other  words,  the  ingenuity  of  the  analyst  coupled  with  his  knowledge 
of  tl :•  field  in  which  he  is  wotking  will  be  <he  major  source  of  test 
variables.  Twt)  methods  of  inducing  such  hypotheses,  however,  may  be 
mentioned.  First,  it  would  be  possible  in  some  studies  to  cross  tabulate 
routinely  every  factor  with  every*  other  factor.  In  practice  this  is  usually 
clumsy  and  the  number  can  be  cut  down  substantially  by  reference  to 
existing  theory.  In  any  case,  wide  ranges  of  cross  tabulations  will  often 
suggest  the  selection  of  test  variables.  Second,  the  study  of  deviant  cases 
will  also  often  indicate  the  kind  of  interpretation,  explanation,  or  speci- 
fication which  will  improve  the  analysis. 

In  the  most  general  sense  it  must  be  said  that  the  whole  process  o| 
analysis  is  not  so  much  a matter  of  manipulative  techniques  as  it  is  of  th% 

• •We  must  itress  that  the  failure  of  one  antecedent  «e*t  factor  10  disqualify  the 
causal  connection  between  two  variables  Is  in  no  way  a definitive  demonstration  of 
that  causal  relationship. 
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rigorous  application  of  the  basic  principles  of  scientific  method.  The  re- 
search worker  who  is  fully  acquainted  with  the  problems  of  designing 
research  will  have  fewer  troubles  in  analyzing  his  data* 


CHAPTER  SEVEN 


CLASS  NOTES 

SOME  ILLUSTRATIVE  EXERCISES 

1,  Random  Selection  of  Experimental  and  Control  Groups 

A research  project  is  being  designed  to  test  the  effectiveness  of  a 
program  for  the  dissemination  of  community  information  by  means 
of  a newsletter.  One  group  of  20  households  (the  experimental 
group)  will  receive  the  newsletter  weekly  for  a period  of  six  months. 
Another  group  of  20  households  (the  control  group)  will  not  receive 
the  newsletter  but  will  be  left  exposed  to  the  normal  flow  of 
community  information. 

Select  the  experimental  and  control  groups  from  the  roster  and 
/or  map  of  community  households  provided  in  the  class  notes 
in  Chapter  Four.  Show  the  research  design  in  tabular  form. 

2,  The  Judgment  of  Representativeness 

Are  the  groups  you  have  selected  representative  samples?  Indicate 
the  basis  for  your  answer: 

Are  the  groups  you  have  selected  representative  samples?  Indicate 
the  basis  for  your  answer: 

(a)  Assume  that  a summary  information  roster  is  available. 

(See  the  end  of  the  roster. ) 

(b)  Assume  that  there  is  no  summary  information. 
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3.  The  Validity  of  Data  Collection 

A basic  variable  in  the  research  project  is  soc  o-economic 
status. 

Operationally  define  five  different  ways  to  determine  the  socio- 
economic status  of  the  houreholds. 

4.  The  Assumption  of  Rectilinearity 

One  of  the  hypotheses  to  be  tested  involves  the  relationship 
between  age  of  the  head  of  the  household  and  the  number  of 
hours  of  television  watched  per  week.  The  members  of  the 
research  team  are  uncertain,  however,  whether  the  relation- 
ship would  be  rectilinear  or  curvilinear.  Inspect  the  following 
data  (constructing  a scatter  diagram  if  necessary). 


- 2 - 


TABLE  I 


AGE  OF  HEAD  OF  HOUSEHOLDS  AND  NUMBER  OF 
HOURS  OF  TELEVISION  WATCHED  PER  WEEK 


AGE  NUMBER  OF  HOURS 


23.9 

1.4 

21.3 

2.2 

31.1 

2.6 

25.2 

1.9 

22.5 

2.0 

31.8 

3.4 

25.3 

.6 

31.5 

3.3 

22.9 

1.5 

23.5 

1.25 

30.5 

2.1 

31.2 

3.0 

24.2 

.8 

27.6 

1.0 

31.6 

2.8 

*8.  9 

1.5 

28.5 

.9 

30.6 

2.7 

29.9 

1.7 

29.8 

2.2 

(a)  Is  the  assumption  of  rcctilincarity  justified? 

(b)  Is  the  correlation  of  age  and  hours  of  watching  television 
High}  Lowj  Non-Existent? 


5.  Variables  Into  Attributes 

For  the  purposes  of  analysis,  we  wish  to  simplify  the  data  given  in  #4 
above.  Convert  the  variables  of  age  and  hours  of  watching  television 
into  dichotomous  attributes.  Justify  your  cutting  points. 

6.  Building  Typologies 

In  a study  of  2,200  Individuals  characterised  by  psychiatrists  as  being 
likely  to  attempt  suicide,  it  was  found  that  there  were  1,100  individuals 
who  had  experienced  great  emotional  stress  in  the  recent  past,  and 
53. 6%  of  those  had  tried  to  commit  suicide.  Of  these  not  experiencing 
great  emotional  stress  In  the  recent  past,  46.4%  had  tried  to  commit 
suicide. 

It  was  also  found  that  of  those  individuals  described  as  suffering  from 
social  isolation — there  were  1,100  of  those  too--as  it  turned  out,  590 
tried  to  commit  suicide.  Of  those  individuals  not  suffering  from 
social  isolation,  510  tried  to  commit  suicide. 

Among  those  individuals  who  had  experienced  emotional  stress,  there 
were  1,000  who  had  not  suffered  from  social  isolation  and  500  of  these 
had  attempted  suicide.  Of  all  those  individuals  who  had  suffered  from 
social  isolation  in  the  group  studied,  there  were  1,000  who  had  not 
experienced  emotional  stress  and  500  of  these  attempted  suicide, 
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The  research  team  concluded  that  there  was  very  little  relationship 
between  attempted  suicide  and  emotional  stress  or  social  isolation, 
when  stress  or  isolation  were  present  by  themselves.  When  stress 
and  isolation  were  both  present,  however,  the  probability  of 
attempted  suicide  increased  s.'.arply. 

Present  the  data  bearing  on  this  hypothesis  in  tabular  form. 

7.  Measuring  a Relationship 
(a)  Variables 

A study  of  the  causes  of  delinquency  has  shown  that  both  age  and  income 
of  parents  are  related  to  the  number  of  serious  delinquent  acts  committed 
by  the  individual.  The  question  to  be  settled  at  this  point  is  which  of 
these  two  variables  (age  or  income)  is  most  closely  correlated  with 
delinquency. 

Make  your  decision  on  the  basis  of  the  following  data: 

(1)  The  total  variation  in  the  number  of  delinquent  acts  equals  20. 

(2)  The  total  of  the  variation  in  the  number  of  delinquent  acts 
explained  on  the  basis  of  income  equals  16. 

(3)  The  total  of  the  variation  in  the  number  of  delinquent  acts 
explained  on  the  basis  of  age  is  14. 

Is  the  larger  correlation  coefficient  (r): 
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Very  low  or  negligible 
Low,  present  but  slight 
Substantial  or  marked 
High  or  very  high 


(.00  + .20) 

(+  .20  to  + .40) 
(+  .40  to  + . 70) 
(+ .70  to + 1.00) 


(b)  Attributes 

Using  Yule's  Q (Q  = ad  - be  ),  Measure  the  association  in  the 

ad  + be 

following  two  tables: 

Which  table  shows  the  higher  correlation? 

(Note:  Show  your  calculations) 


TABLE  I 

INCOME 


High 

Low 

Age  25  - 45 

20 

10 

46  or  more 

10 

20 
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TABLE  2 


INCOME 


High 

Low 

Race  Negro 

15 

30 

White 

30 

15 

8.  Testing  for  Significance  of  Difference 

A community  project  to  reduce  unemployment  is  being  based  in  part 
on  a study  of  the  job  histories  of  workers  at  a number  of  large  manu- 
facturing companies  in  the  area.  The  data  show  that  the  mean  age  of 
male  workers  who  have  been  fired  in  the  recent  past  is  31.  97;  the 
mean  age  of  female  workers  fired  is  33.  39. 

The  data  are  based  on  a random  sample  of  713  unemployed  men  and  a 
random  sample  of  287  unemployed  women.  The  immediate  question 
is  whether  *he  difference  of  1.  42  years  in  the  mean  age  of  male  and 
female  workers  who  have  been  fired  is  simply  due  to  sampling 
vagaries  or  represents  a real  difference  in  the  populations  of  male 
and  female  workers  which  might  have  to  be  taken  into  account  in  the 
community  project. 
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(a)  Slate  the  null  hypothesis  in  full. 

(b)  Test  your  hypothesis  in  light  of  the  following  data: 

Standard  Error  of  the  Difference 
Between  Means  = . S70 

95%  cf  the  normal  distribution  will  fall  within  1.  96  standard 
deviations  above  and  below  the  mean. 

99%  of  the  normal  distribution  will  fall  within  2.  58  standard 
deviations  above  and  below  the  mean. 


(c)  State  your  conclusion  about  your  hypothesis,  with  specific 
reference  to  the  level  of  significance. 

9.  The  Influence  of  an  Intervening  Characteristic 

A survey  of  lawyers  in  private  practice  reveals  that  the  longer  the 
lawyer  has  been  in  practice  the  higher  his  annual  income.  (The 
data  are  given  below. ) 


ANNUAL  INCOME 


Length  of  Practice 

$15, 000  per  Year 

More  than  $15,000 

or  Less 

per  Year 

0-10  Years 

700 

300 

11  Years  or  More 

510 

690 

It  is  hypothesized  that  a major  rcaoun  for  this  association  between  length 
of  practice  and  income  is  that  the  longer  a lawyer  is  in  practice,  the  more 
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likely  he  is  to  become  a partner  in  a firm — and  such  partners  will  tend 
to  have  higher  incomes  than  non-partners. 


The  necessary  information  is  given  below: 


Length  of  Practice 

Annual  Income 

Partner 

Total  Number 
of  Lawyers 

11  Years  or  More 

More  than  $15,000 

Yes 

640 

11  Yea^s  or  More 

More  than  $15, 000 

No 

50 

11  Years  or  More 

$15,000  or  Less 

Yes 

160 

11  Years  or  More 

$15,000  or  Less 

No 

350 

0-10  Years 

More  than  $15, 000 

Yes 

200 

0-10  Years 

More  than  $15,  000 

No 

100 

0-10  Years 

$15,000  or  Less 

Yes 

100 

0-10  Years 

$15,000  or  Less 

No 

600 

10.  The  Influence  of  an  Antecedent  Characteristic 

A team  of  participant  observers,  working  with  a government  agency, 
have  come  to  the  conclusion  that  executives  who  are  best  able  to 
handle  criticism  from  those  who  supervise  them  also  tend  to  be  execu- 
tives who  get  along  best  with  their  subordinates. 

(The  data  age  given  below. ) 


Ratings  of  Executives  on  Responses  to 
Criticism 


Ratings  of  Executives  on 

Relations  with  Subordinates 

Good 

Poor 

Total 

Good 

50 

40 

90 

Poor 

40 

50 

90 
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It  is  hypothesized  that  the  observed  relationship  between  the  two  ratings  is 
due  in  part  to  a management  training  program  which  was  provided  for  one- 
half  of  the  group.  That  is  to  say,  the  "good"  ratings  on  both  characteristics 
are  caused  by  the  management  training,  but  outside  of  that  there  is  no 
inherent  connection  between  them. 

The  information  necessary  to  test  this  hypothesis  is  given  below: 


Management  Training  Program  Relations  with  Reactions  to 

Subordinates  Criticism 


Yes 

Good 

Good 

40 

Yes 

Good 

Poor 

20 

Yes 

Poor 

Good 

20 

Yes 

Poor 

Poor 

10 

No 

Good 

Good 

10 

No 

Good 

Poor 

2C 

No 

Poor 

Good 

20 

No 

Poor 

Poor 

40 

Indicate  if  the  hypothesis  is  correct,  using  the  necessary  tables  to 
substantiate  your  conclusion. 


Is  there  any  correlation  between  ratings,  with  exposure  to  the 
management  training  program  held  constant  ? 
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11.  Specification  of  Conditions 


I.  A study  has  been  made  which  indicates  that  families  presently  on 
welfare  tend  to  be  more  permissive  in  child-raising  than  families 
which  were  on  welfare  two  years  ago.  (The  data  are  given  below. } 

Permissive  Child-Rearing  Practices 


Yes 

No 

Total 

Families 

Now  on  Welfare 

180 

120 

300 

Families 

on  Welfare  2 Years  Ago 

120 

180 

300 

The  data  indicate,  however,  that  families  now  on  welfare  differ 
from  families  on  welfare  two  years  ago  in  terms  of  the  age  of 
the  head  of  the  household;  and  since  age  of  the  head  of  the 
household  may  be  related  to  child-rearing  practices,  it  has  been 
suggested  that  the  age  of  the  head  of  the  household  be  held 
constant  in  making  comparisons. 

When  age  is  held  constant,  is  the  proportion  of  families  with 
permissive  child-rearing  practices  among  those  families  now 
on  welfare  greater  than,  less  than,  or  equal  to  the  proportion 
among  those  families  on  welfare  two  years  ago? 

The  necessary  information  is  given  below: 
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Permissive  Child-Rearing  Practices 


Yes 

No 

Total 

Families  Now  on  Welfare 

Age  of  Head 

20-24 

72 

18 

90 

25-29 

88 

22 

110 

30-34 

12 

48 

60 

35-40 

8 

32 

40 

Families  on  Welfare  2 Years  Ago 
Age  of  Head 
20-24 

36 

9 

45 

25-29 

44 

11 

55 

30-34 

22 

88 

110 

35-40 

18 

72 

90 

If  families  now  on  welfare  had  the  same  age  distribution  as  families  on 
welfare  two  years  ago,  what  proportion  of  families  now  on  welfare 
would  have  permissive  child-rearing  practices  ? 


II.  In  examining  the  relationship  between  race  and  perception  of 

occupational  opportunities,  it  is  found  that  there  is  no  difference 
between  the  fwo  groups  in  the  communHy  studied.  (The  data 
are  given  below. ) 

Race  Perception  of  Occupational  Opportunities 


Many  Opportunities 

No  Opportunities 

Total 

White 

160 

340 

500 

Negro 

160 

340 

500 
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The  data  indicate,  however,  that  the  two  random  samples  of  whites  and 
Negroes  differ  with  respect  to  individuels1  belief  In  the  availability  of 
blue  collar  jobs.  Since  belief  in  the  availability  of  blue  collar  jobs  may 
influence  the  individual's  perception  of  occupational  opportunity,  it  has 
been  suggested  that  the  belief  in  availability  of  blue  collar  jobs  be  held 
constant  in  making  comparisons. 

When  belief  in  the  availability  of  blue  collar  jobs  is  held  constant,  do 
Negroes  and  whites  differ  with  regard  to  their  perception  of  occupational 
opportunity  ? 

The  necessary  information  is  given  below: 

Perception  of  Occupational  Opportunities 


Many  Opportunities 

No  Opportunities 

Total 

Negroes 

Blue  Collar  Jobs  Available 

80 

20 

100 

Blue  Collar  Jobs  Not  Available 

80 

320 

400 

Whites 

Blue  Collar  Jobs  Available 

80 

320 

400 

Blue  Collar  Jobs  Not  Available 

80 

20 

100 

If  Negroes  were  distributed  in  the  same  fashion  as  whites  with  respect  to 
belief  in  the  availability  of  blue  collar  jobs,  what  proportion  of  Negroes 
would  believe  that  there  were  many  occupational  opportunities? 
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